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Preface for Students 


Quantum field theory is the basic mathematical language that is used to 
describe and analyze the physics of elementary particles. The goal of this 
book is to provide a concise, step-by-step introduction to this subject, one 
that covers all the key concepts that are needed to understand the Standard 
Model of elementary particles, and some of its proposed extensions. 

In order to be prepared to undertake the study of quantum field theory, 
you should recognize and understand the following equations: 

a'\n ) = v/n+1 |n+l) 

J±\j,m) = V j(j+l)-m(m±l) \j, m± 1} 

A(t) = e + iHt / h Ae~ iHt / n 
H = pq — L 
ct' = r /(ct — fix) 

E = (p 2 c 2 + m 2 c 4 ) 1/2 
E = — A/c — \7tp 

This list is not, of course, complete; but if you are familiar with these 
equations, you probably know enough about quantum mechanics, classical 
mechanics, special relativity, and electromagnetism to tackle the material 
in this book. 

Quantum field theory has a reputation as a subject that is hard to 
learn. The problem, I think, is not so much that its basic ingredients are 
unusually difficult to master (indeed, the conceptual shift needed to go 
from quantum mechanics to quantum held theory is not nearly as severe 
as the one needed to go from classical mechanics to quantum mechanics), 
but rather that there are a lot of these ingredients. Some are fundamental, 
but many are just technical aspects of an unfamiliar form of perturbation 
theory. 

In this book, I have tried to make the subject as accessible to beginners 
as possible. There are three main aspects to my approach. 

Logical development of the basic concepts. This is, of course, very differ- 
ent from the historical development of quantum held theory, which, like the 
historical development of most worthwhile subjects, was filled with inspired 
guesses and brilliant extrapolations of sometimes fuzzy ideas, as well as its 
fair share of mistakes, misconceptions, and dead ends. None of that is in 
this book. From this book, you will (I hope) get the impression that the 
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whole subject is effortlessly clear and obvious, with one step following the 
next like sunshine after a refreshing rain. 

Illustration of the basic concepts with the simplest examples. In most 
fields of human endeavor, newcomers are not expected to do the most de- 
manding tasks right away. It takes time, dedication, and lots of practice to 
work up to what the accomplished masters are doing. There is no reason to 
expect quantum field theory to be any different in this regard. Therefore, 
we will start off analyzing quantum field theories that are not immediately 
applicable to the real world of electrons, photons, protons, etc., but that 
will allow us to gain familiarity with the tools we will need, and to practice 
using them. Then, when we do work up to “real physics” , we will be fully 
ready for the task. To this end, the book is divided into three parts: Spin 
Zero, Spin One Half, and Spin One. The technical complexities associated 
with a particular type of particle increase with its spin. We will therefore 
first learn all we can about spinless particles before moving on to the more 
difficult (and more interesting) nonzero spins. Once we get to them, we 
will do a good variety of calculations in (and beyond) the Standard Model 
of elementary particles. 

User friendliness. Each of the three parts is divided into numerous sec- 
tions. Each section is intended to treat one idea or concept or calculation, 
and each is written to be as self-contained as possible. For example, when 
an equation from an earlier section is needed, I usually just repeat it, rather 
than ask you to leaf back and find it (a reader’s task that I’ve always found 
annoying). Furthermore, each section is labeled with its immediate pre- 
requisites, so you can tell exactly what you need to have learned in order 
to proceed. This allows you to construct chains to whatever material may 
interest you, and to get there as quickly as possible. 

That said, I expect that most readers of this book will encounter it as 
the textbook in a course on quantum field theory. In that case, of course, 
your reading will be guided by your professor, who I hope will find the 
above features useful. If, however, you are reading this book on your own, 
I have two pieces of advice. 

The first (and most important) is this: find someone else to read it with 
you. I promise that it will be far more fun and rewarding that way; talking 
about a subject to another human being will inevitably improve the depth 
of your understanding. And you will have someone to work with you on 
the problems. (As with all physics texts, the problems are a key ingredient. 
I will not belabor this point, because if you have gotten this far in physics, 
you already know it well.) 

The second piece of advice echoes the novelist and Nobel laureate 
William Faulkner. An interviewer asked, “Mr. Faulker, some of your read- 
ers claim they still cannot understand your work after reading it two or 
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three times. What approach would you advise them to adopt?” Faulkner 
replied, “Read it a fourth time.” 

That’s my advice here as well. After the fourth attempt, though, you 
should consider trying something else. This is, after all, not the only book 
that has ever been written on the subject. You may find that a different 
approach (or even the same approach explained in different words) breaks 
the logjam in your thinking. There are a number of excellent books that 
you could consult, some of which are listed in the Bibliography. I have also 
listed particular books that I think could be helpful on specific topics in 
Reference Notes at the end of some of the sections. 

This textbook (like all finite textbooks) has a number of deficiencies. 
One of these is a rather low level of mathematical rigor. This is partly en- 
demic to the subject; rigorous proofs in quantum field theory are relatively 
rare, and do not appear in the overwhelming majority of research papers. 
Even some of the most basic notions lack proof; for example, currently 
you can get a million dollars from the Clay Mathematics Institute simply 
for proving that nonabelian gauge theory actually exists and has a unique 
ground state. Given this general situation, and since this is an introductory 
book, the proofs that we do have are only outlined, those proofs that we 
do have are only outlined. 

Another deficiency of this book is that there is no discussion of the 
application of quantum field theory to condensed matter physics, where 
it also plays an important role. This connection has been important in 
the historical development of the subject, and is especially useful if you 
already know a lot of advanced statistical mechanics. I do not want this 
to be a prerequisite, however, and so I have chosen to keep the focus on 
applications within elementary particle physics. 

Yet another deficiency is that there are no references to the original 
literature. In this regard, I am following a standard trend: as the foun- 
dations of a branch of science retreat into history, textbooks become more 
and more synthetic and reductionist. For example, it is now rare to see a 
new textbook on quantum mechanics that refers to the original papers by 
the famous founders of the subject. For guides to the original literature 
on quantum field theory, there are a number of other books with extensive 
references that you can consult; these include Peskin & Schroeder , Wein- 
berg, and Siegel. (Italicized names refer to works listed in the Bibliography.) 
Unless otherwise noted, experimental numbers are taken from the Review 
of Particle Properties, available online at http://pdg.lbl.gov. Experimen- 
tal numbers quoted in this book have an uncertainty of roughly ±1 in the 
last significiant digit. The Review should be consulted for the most recent 
experimental results, and for more precise statements of their uncertainty. 

To conclude, let me say that you are about to embark on a tour of one of 
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humanity’s greatest intellectual endeavors, and certainly the one that has 
produced the most precise and accurate description of the natural world as 
we find it. I hope you enjoy the ride. 
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Preface for Instructors 

On learning that a new text on quantum field theory has appeared, one is 
surely tempted to respond with Isidor Rabi’s famous comment about the 
muon: “Who ordered that?' After all, many excellent textbooks on quan- 
tum field theory are already available. I, for example, would not want to be 
without my well-worn copies of Quantum Field Theory by Lowell S. Brown 
(Cambridge 1994), Aspects of Symmetry by Sidney Coleman (Cambridge 
1985), Introduction to Quantum Field Theory by Michael E. Peskin and 
Daniel V. Schroeder (Westview 1995), Field Theory: A Modern Primer by 
Pierre Rarnond (Addison- Wesley 1990), Fields by Warren Siegel (arXiv.org 
2005), The Quantum Theory of Fields , Volumes I, II, and III, by Steven 
Weinberg (Cambridge 1995), and Quantum Field Theory in a Nutshell by 
my colleague Tony Zee (Princeton 2003), to name just a few of the more 
recent texts. Nevertheless, despite the excellence of these and other books, 
I have never followed any of them very closely in my twenty years of on- 
and-off teaching of a year-long course in relativistic quantum field theory. 

As discussed in the Preface for Students, this book is based on the no- 
tion that quantum field theory is most readily learned by starting with the 
simplest examples and working through their details in a logical fashion. 
To this end, I have tried to set things up at the very beginning to antici- 
pate the eventual need for renormalization, and not be cavalier about how 
the fields are normalized and the parameters defined. I believe that these 
precautions take a lot of the “hocus pocus” (to quote Feynman) out of the 
“dippy process” of renormalization. Indeed, with this approach, even the 
anharmonic oscillator is in need of renormalization; see problem 14.7. 

A field theory with many pedagogical virtues is (p 3 theory in six di- 
mensions, where its coupling constant is dimensionless. Perhaps because 
six dimensions used to seem too outre (though today’s prospective string 
theorists don’t even blink), the only introductory textbook I know of that 
treats this model is Quantum Field Theory by George Sterman (Cambridge 
1993), though it is also discussed in some more advanced books, such as 
Renormalization by John Collins (Cambridge 1984) and Foundations of 
Quantum Chromodynamics by T. Muta (World Scientific 1998). (There is 
also a series of lectures by Ed Witten on quantum field theory for math- 
ematicians, available online, that treat (p 3 theory.) The reason p 3 theory 
in six dimensions is a nice example is that its Feynman diagrams have a 
simple structure, but still exhibit the generic phenomena of renormalizable 
quantum field theory at the one-loop level. (The same cannot be said for ip 4 
theory in four dimensions, where momentum-dependent corrections to the 
propagator do not appear until the two-loop level.) Thus, in Part I of this 
text, p 3 theory in six dimensions is the primary example. I use it to give 
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introductory treatments of most aspects of relativistic quantum field theory 
for spin-zero particles, with a minimum of the technical complications that 
arise in more realistic theories (like QED) with higher-spin particles. 

Although I eventually discuss the Wilson approach to renormalization 
and effective field theory (in section 29), and use effective field theory exten- 
sively for the physics of hadrons in Part III, I do not feel it is pedagogically 
useful to bring it in at the very beginning, as is sometimes advocated. The 
problem is that the key notion of the decoupling of physical processes at dif- 
ferent length scales is an unfamiliar one for most students; there is nothing 
in typical courses on quantum mechanics or electomagnetism or classical 
mechanics to prepare students for this idea (which was deemed worthy of a 
Nobel Prize for Ken Wilson in 1982). It also does not provide for a simple 
calculational framework, since one must deal with the infinite number of 
terms in the effective lagrangian, and then explain why most of them don’t 
matter after all. It’s noteworthy that Wilson himself did not spend a lot 
of time computing properly normalized perturbative S'-matrix elements, a 
skill that we certainly want our students to have; we want them to have 
it because a great deal of current research still depends on it. Indeed, the 
vaunted success of quantum field theory as a description of the real world is 
based almost entirely on our ability to carry out these perturbative calcula- 
tions. Studying renormalization early on has other pedagogical advantages. 
With the Nobel Prizes to Gerard ’t Hooft and Tini Veltman in 1999 and to 
David Gross, David Politzer, and Frank Wilczek in 2004, today’s students 
are well aware of beta functions and running couplings, and would like to 
understand them. I find that they are generally much more excited about 
this (even in the context of toy models) than they are about learning to 
reproduce the nearly century-old tree-level calculations of QED. And <£> 3 
theory in six dimensions is asymptotically free, which ultimately provides 
for a nice segue to the “real physics” of QCD. 

In general I have tried to present topics so that the more interesting as- 
pects (from a present-day point of view) come first. An example is anoma- 
lies; the traditional approach is to start with the 7 r° — ► 77 decay rate, 
but such a low-energy process seems like a dusty relic to most of today’s 
students. I therefore begin by demonstrating that anomalies destroy the 
self-consistency of the great majority of chiral gauge theories, a fact that 
strikes me (and, in my experience, most students) as much more interest- 
ing and dramatic than an incorrect calculation of the 7 r° decay rate. Then, 
when we do eventually get to this process (in section 90), it appears as a 
straightforward consequence of what we already learned about anomalies 
in sections 75-77. 

Nevertheless, I want this book to be useful to those who disagree with 
my pedagogical choices, and so I have tried to structure it to allow for 
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maximum flexibility. Each section treats a particular idea or concept or 
calculation, and is as self-contained as possible. Each section also lists 
its immediate prerequisites, so that it is easy to see how to rearrange the 
material to suit your personal preferences. 

In some cases, alternative approaches are developed in the problems. 
For example, I have chosen to introduce path integrals relatively early 
(though not before canonical quantization and operator methods are ap- 
plied to free- field theory), and use them to derive Dyson’s expansion. For 
those who would prefer to delay the introduction of path integrals (but since 
you will have to cover them eventually, why not get it over with?), problem 
9.5 outlines the operator-based derivation in the interaction picture. 

Another point worth noting is that a textbook and lectures are ideally 
complementary. Many sections of this book contain rather tedious mathe- 
matical detail that I would not and do not write on the blackboard during 
a lecture. (Indeed, the earliest origins of this book are supplementary notes 
that I typed up and handed out.) For example, much of the development of 
Weyl spinors in sections 34-37 can be left to outside reading. I do encour- 
age you not to eliminate this material entirely, however; pedagogically, the 
problem with skipping directly to four-component notation is explaining 
that (in four dimensions) the hermitian conjugate of a left-handed field is 
right handed, a deeply important fact that is the key to solving problems 
such as 36.5 and 83.1, which are in turn vital to understanding the struc- 
ture of the Standard Model and its extensions. A related topic is computing 
scattering amplitudes for Majorana fields; this is essential for modern re- 
search on massive neutrinos and supersymmetric particles, though it could 
be left out of a time-limited course. 

While I have sometimes included more mathematical detail than is ideal 
for a lecture, I have also tended to omit explanations based on “physical 
intuition.” For example, in section 90, we compute the t~vn decay 

amplitude (where £ is a charged lepton) and find that it is proportional to 
the lepton mass. There is a well-known heuristic explanation of this fact 
that goes something like this: “The pion has spin zero, and so the lepton 
and the antineutrino must emerge with opposite spin, and therefore the 
same helicity. An antineutrino is always right-handed, and so the lepton 
must be as well. But only the left-handed lepton couples to the W~, so 
the decay amplitude vanishes if the left- and right-handed leptons are not 
coupled by a mass term.” 

This is essentially correct, but the reasoning is a bit more subtle than 
it first appears. A student may ask, “Why can’t there be orbital angular 
momentum? Then the lepton and the antineutrino could have the same 
spin.” The answer is that orbital angular momentum must be perpendicular 
to the linear momentum, whereas helicity is (by definition) parallel to the 
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linear momentum; so adding orbital angular momentum cannot change the 
helicity assignments. (This is explored in a simplified model in problem 
48.4.) The larger point is that intuitive explanations can almost always be 
probed more deeply. This is fine in a classroom, where you are available to 
answer questions, but a textbook author has a hard time knowing where 
to stop. Too little detail renders the explanations opaque, and too much 
can be overwhelming; furthermore the happy medium tends to differ from 
student to student. The calculation, on the other hand, is definitive (at 
least within the framework being explored, and modulo the possibility of 
mathematical error). As Roger Penrose once said, “The great thing about 
physical intuition is that it can be adjusted to fit the facts.” So, in this 
book, I have tended to emphasize calculational detail at the expense of 
heuristic reasoning. Lectures should ideally invert this to some extent. 

I should also mention that a section of the book is not intended to 
coincide exactly with a lecture. The material in some sections could easily 
be covered in less than an hour, and some would clearly take more. My 
approach in lecturing is to try to keep to a pace that allows the students to 
follow the analysis, and then try to come to a more-or-less natural stopping 
point when class time is up. This sometimes means ending in the middle 
of a long calculation, but I feel that this is better than trying to artificially 
speed things along to reach a predetermined destination. 

It would take at least three semesters of lectures to cover this entire 
book, and so a year-long course must omit some. A sequence I might 
follow is 1-23, 26-28, 33-43, 45-48, 51, 52, 54-59, 62-64, 66-68, 24, 69, 70, 
44, 53, 71-73, 75-77, 30, 32, 84, 87-89, 29, 82, 83, 90, and, if any time was 
left, a selection of whatever seemed of most interest to me and the students 
of the remaining material. 

To conclude, I hope you find this book to be a useful tool in working to- 
wards our mutual goal of bringing humanity’s understanding of the physics 
of elementary particles to a new audience. 



16 


Acknowledgments 

Every book is a collaborative effort, even if there is only one author on 
the title page. Any skills I may have as a teacher were first gleaned as a 
student in the classes of those who taught me. My first and most important 
teachers were my parents, Casimir and Helen Srednicki, to whom this book 
is dedicated. In our small town in Ohio, my excellent public-school teachers 
included Esta Kefauver, Marie Casher, Carol Baird, Jim Chase, Joe Gerin, 
Hugh Laughlin, and Tom Murphy. In college at Cornell, Don Hartill, Bruce 
Kusse, Bob Siemann, John Kogut, and Saul Teukolsky taught particularly 
memorable courses. In graduate school at Stanford, Roberto Peccei gave 
me my first exposure to quantum field theory, in a superb course that 
required bicycling in by 8:30 AM (which seemed like a major sacrifice at 
the time). Everyone in that class very much hoped that Roberto would one 
day turn his extensive hand-written lecture notes (which he put on reserve 
in the library) into a book. He never did, but I’d like to think that perhaps 
a bit of his consummate skill has found its way into this text. I have also 
used a couple of his jokes. 

My thesis advisor at Stanford, Lenny Susskind, taught me how to think 
about physics without getting bogged down in the details. This book in- 
cludes a lot of detail that Lenny would no doubt have left out, but while 
writing it I have tried to keep his exemplary clarity of thought in mind as 
something to strive for. 

During my time in graduate school, and subsequently in postdoctoral 
positions at Princeton and CERN, and finally as a faculty member at UC 
Santa Barbara, I was extremely fortunate to be able to interact with many 
excellent physicists, from whom I learned an enormous amount. These 
include Stuart Freedman, Eduardo Fradkin, Steve Shenker, Sidney Cole- 
man, Savas Dimopoulos, Stuart Raby, Michael Dine, Willy Fischler, Curt 
Callan, David Gross, Malcolm Perry, Sam Trieman, Arthur Wightman, Ed 
Witten, Hans-Peter Nilles, Daniel Wyler, Dmitri Nanopoulos, John Ellis, 
Keith Olive, Jose Fulco, Ray Sawyer, John Cardy, Frank Wilczek, Jim 
Hartle, Gary Horowitz, Andy Strominger, and Tony Zee. I am especially 
grateful to my Santa Barbara colleagues David Berenstein, Steve Giddings, 
Don Marolf, Joe Polchinski, and Bob Sugar, who used various drafts of this 
book while teaching quantum field theory, and made various suggestions 
for improvement. 

I am also grateful to physicists at other institutions who read parts of the 
manuscript and also made suggestions, including Oliver de Wolfe, Marcelo 
Gleiser, Steve Gottlieb, Arkady Tsetlyn, and Arkady Vainshtein. I must 
single out for special thanks Professor Heidi Fearn of Cal State Fullerton, 
whose careful reading of Parts I and II allowed me to correct many unclear 



17 


passages and outright errors that would otherwise have slipped by. 

Students over the years have suffered through my varied attempts to 
arrive at a pedagogically acceptable scheme for teaching quantum field 
theory. I thank all of them for their indulgence. I am especially grateful 
to Sam Pinansky, Tae Min Hong, and Sho Yaida for their diligence in 
finding and reporting errors, and to Brian Wignal for help with formatting 
the manuscript. Also, a number of students from around the world (as 
well as Santa Barbara) kindly reported errors in versions of this book that 
were posted online; these include Omri Bahat-Treidel, Hee-Joong Chung, 
Yevgeny Kats, Sue Ann Koay, Peter Lee, Nikhil Jayant Joshi, Kevin Weil, 
Dusan Simic, and Miles Stoudenmire. I thank them for their help, and 
apologize to anyone that I may have missed. 

Throughout this project, the assistance and support of my wife Eloise 
and daughter Julia were invaluable. Eloise read through the manuscript 
and made suggestions that often clarified the language. Julia offered advice 
on the cover design (a highly stylized Feynman diagram). And they both 
kindly indulged the amount of time I spent working on this book that you 
now hold in your hands. 



Part I 

Spin Zero 



1: Attempts at relativistic quantum mechanics 


19 


1 Attempts at relativistic quantum 

MECHANICS 
Prerequisite: none 

In order to combine quantum mechanics and relativity, we must first un- 
derstand what we mean by “quantum mechanics” and “relativity”. Let us 
begin with quantum mechanics. 

Somewhere in most textbooks on the subject, one can find a list of the 
“axioms of quantum mechanics” . These include statements along the lines 
of 


The state of the system is represented by a vector in Hilbert 
space. 

Observables are represented by hermitian operators. 

The measurement of an observable yields one of its eigenvalues 
as the result. 

And so on. We do not need to review these closely here. The axiom we 
need to focus on is the one that says that the time evolution of the state of 
the system is governed by the Schrodinger equation, 

ih^i\if,t) = , ( 1 . 1 ) 

where H is the hamiltonian operator, representing the total energy. 

Let us consider a very simple system: a spinless, nonrelativistic particle 
with no forces acting on it. In this case, the hamiltonian is 

H = — P 2 , (1.2) 

2m V ’ 

where m is the particle’s mass, and P is the momentum operator. In the 

position basis, eq. (1.1) becomes 

= ~V 2 ^(pc,t) , (1.3) 

where ip(x,t) = (x| if,t) is the position-space wave function. We would like 
to generalize this to relativistic motion. 

The obvious way to proceed is to take 


H = + v P 2 c 2 + m 2 c 4 


(1.4) 
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which yields the correct relativistic energy-momentum relation. If we for- 
mally expand this hamiltonian in inverse powers of the speed of light c, we 
get 

H = me 2 - 1 — — P 2 + . . . . (1.5) 

2m 

This is simply a constant (the rest energy), plus the usual nonrelativistic 
hamiltonian, eq. (1.2), plus higher-order corrections. With the hamiltonian 
given by eq. (1.4), the Schrodinger equation becomes 

= +\! — ft 2 c 2 V 2 +m 2 (A 4>Cx,t) . (1.6) 

ot. 


Unfortunately, this equation presents us with a number of difficulties. One 
is that it apparently treats space and time on a different footing: the time 
derivative appears only on the left, outside the square root, and the space 
derivatives appear only on the right, under the square root. This asymme- 
try between space and time is not what we would expect of a relativistic 
theory. Furthermore, if we expand the square root in powers of V 2 , we get 
an infinite number of spatial derivatives acting on ^(x, f); this implies that 
eq. (1.6) is not local in space. 

We can alleviate these problems by squaring the differential operators 
on each side of eq. (1.6) before applying them to the wave function. Then 
we get 

= ( -h 2 c 2 v 2 +m 2 c 4 )y>(x, t) . (1.7) 


This is the Klein-Gordon equation , and it looks a lot nicer than eq. (1.6). 
It is second-order in both space and time derivatives, and they appear in a 
symmetric fashion. 

To better understand the Klein-Gordon equation, let us consider in 
more detail what we mean by “relativity”. Special relativity tells us that 
physics looks the same in all inertial frames. To explain what this means, we 
first suppose that a certain spacetime coordinate system (ct, x) represents 
(by fiat) an inertial frame. Let us define x° = ct, and write x where 
p = 0, 1, 2, 3, in place of (ct, x). It is also convenient (for reasons not at all 
obvious at this point) to define xq = — x° and Xi = x l , where i = 1,2,3. 
This can be expressed more elegantly if we first introduce the Minkowski 
metric, 


9n v 


r +l 


v 


+i 



(1.8) 


where blank entries are zero. We then have x ^ = g^ w x u , where a repeated 
index is summed. 
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To invert this formula, we introduce the inverse of g, which is confusingly 
also called g, except with both indices up: 

(~ l \ 

(1.9) 

\ + 1 / 

We then have g^ v g up = <5 M p , where 5^ p is the Kronecker delta (equal to one 
if its two indices take on the same value, zero otherwise). Now we can also 
write x ^ = g ,ll> x u . 

It is a general rule that any pair of repeated (and therefore summed) 
indices must consist of one superscript and one subscript; these indices are 
said to be contracted. Also, any unrepeated (and therefore unsummed) 
indices must match (in both name and height) on the left- and right-hand 
sides of any valid equation. 

Now we are ready to specify what we mean by an inertial frame. If the 
coordinates x ^ represent an inertial frame (which they do, by assumption), 
then so do any other coordinates that are related by 

= A lx u x l/ + a lx , (1.10) 


<r = 


+i 


+i 


where A^ u is a Lorentz transformation matrix and is a translation vector. 
Both A^ v and a ^ are constant (that is, independent of x M ). Furthermore, 
A! x v must obey 

g pv A\A\ = g pa . (1.11) 

Eq. (1.11) ensures that the interval between two different spacetime points 
that are labeled by and x'^ in one inertial frame, and by and x ,fl in 
another, is the same. This interval is defined to be 

(■ x - x') 2 = g pu (x - x'Y(x - x'Y 

= (x-x') 2 -c 2 {t-t') 2 . (1.12) 

In the other frame, we have 

(x - x') 2 = g^(x - x'Y(x - x'Y 

= g P vA^ p A v a {x - x'Y(x - x'Y 
= 9 p Yx - x'Y(x - x'Y 

= {x — x') 2 , (1.13) 


as desired. 

When we say that physics looks the same , we mean that two observers 
(Alice and Bob, say) using two different sets of coordinates (representing 
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two different inertial frames) should agree on the predicted results of all 
possible experiments. In the case of quantum mechanics, this requires Alice 
and Bob to agree on the value of the wave function at a particular spacetime 
point, a point that is called x by Alice and x by Bob. Thus if Alice’s 
predicted wave function is ip(x), and Bob’s is V’(x), then we should have 
ip(x) = i[)(x). Furthermore, in order to maintain il>(x) = ^(x) throughout 
spacetime, ii’( x ) and t/5( x ) should obey identical equations of motion. Thus 
a candidate wave equation should take the same form in any inertial frame. 

Let us see if this is true of the Klein-Gordon equation. We first introduce 
some useful notation for spacetime derivatives: 


o = JL 

,l dx v 


d ** = 


<) 

dx. 


Note that 


1 d „ \ 

+ c*’ V )’ 

(1.14) 

i d \ 

(1.15) 

g pu , 

(1.16) 


so that our matching-index-height rule is satisfied. 

If x and x are related by eq. (1.10), then d and d are related by 


9" = A » v d v . 


(1.17) 


To check this, we note that 

d p x a = (AV M )(A tT v x v + o'*) = A\A\(d' i x v ) = ° v <r = g pa , 

(1.18) 

as expected. The last equality in eq. (1.18) is another form of eq. (1.11); see 
section 2. 

We can now write eq. (1.7) as 

—h 2 c 2 dQ^(x) = (—h 2 c 2 V 2 + m 2 c 4 )'i/’(x) . (1-19) 

After rearranging and identifying d 2 = d^d^ = — d$ + V 2 , we have 

(— d 2 + m 2 c 2 /fi 2 )il;(x) = 0 . (1-20) 

This is Alice’s form of the equation. Bob would write 

(— d 2 + m 2 c 2 /h 2 )$(x ) = 0 . (1-21) 

Is Bob’s equation equivalent to Alice’s equation? To see that it is, we set 
V>(x) = tp(x), and note that 


d 2 = g^fy fr = g^pA^dPd* = d 2 . 


( 1 . 22 ) 
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Thus, eq. (1.21) is indeed equivalent to eq. (1.20). The Klein-Gordon equa- 
tion is therefore manifestly consistent with relativity: it takes the same 
form in every inertial frame. 

This is the good news. The bad news is that the Klein-Gordon equation 
violates one of the axioms of quantum mechanics: eq. (1.1), the Schrodinger 
equation in its abstract form. The abstract Schrodinger equation has the 
fundamental property of being first order in the time derivative, whereas the 
Klein-Gordon equation is second order. This may not seem too important, 
but in fact it has drastic consequences. One of these is that the norm of a 
state, 

(ip,t\ip,t) = J d 3 x (V’, i|x)(x|i/>, t) = j d 3 x ?/>* (x)'ip(x), (1-23) 

is not in general time independent. Thus probability is not conserved. The 
Klein-Gordon equation obeys relativity, but not quantum mechanics. 

Dirac attempted to solve this problem (for spin-one-half particles) by 
introducing an extra discrete label on the wave function, to account for 
spin: ip a {x), a = 1,2. He then tried a Schrodinger equation of the form 

d 

ih-^> a {x) = (- ihc(a j )abdj + mc 2 ((3) a b)A{x) , (1-24) 

where all repeated indices are summed, and a 3 and (3 are matrices in spin- 
space. This equation, the Dirac equation , is consistent with the abstract 
Schrodinger equation. The state \ip, a, t) carries a spin label a, and the 
hamiltonian is 

H ab = cPj(a j )ab + rn,c 2 ([3)ab , (1-25) 

where Pj is a component of the momentum operator. 

Since the Dirac equation is linear in both time and space derivatives, 
it has a chance to be consistent with relativity. Note that squaring the 
hamiltonian yields 

{H 2 )ab = c 2 PjP k {oP a k ) ab + m,c 3 Pj(a J (3 + (3ot 3 ) ab + {me 2 ) 2 ((3 2 ) ab . (1.26) 

Since PjPk is symmetric on exchange of j and k, we can replace a 3 a k by 
its symmetric part, |{a4, a k }, where {A, B} = AB + BA is the anticom- 
mutator. Then, if we choose matrices such that 

{a 3 , a k } ab = 25 jk 5 ab , {a 3 , (3} ab = 0 , ((3 2 ) a b = Kb , (1-27) 

we will get 

{H 2 )ab = {P 2 c 2 + m 2 c*)6 ab . (1.28) 

Thus, the eigenstates of H 2 are momentum eigenstates, with H 2 eigenvalue 
p 2 c 2 + m 2 c 4 . This is, of course, the correct relativistic energy-momentum 
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relation. While it is outside the scope of this section to demonstrate it, it 
turns out that the Dirac equation is fully consistent with relativity provided 
the Dirac matrices obey eq. (1.27). So we have apparently succeeded in 
constructing a quantum mechanical, relativistic theory! 

There are, however, some problems. We would like the Dirac matrices 
to be 2 x 2, in order to account for electron spin. However, they must 
in fact be larger. To see this, note that the 2x2 Pauli matrices obey 
{a l ,a J } = 2 5 lJ , and are thus candidates for the Dirac a 1 matrices. However, 
there is no fourth matrix that anticommutes with these three (easily proven 
by writing down the most general 2x2 matrix and working out the three 
anticommutators explicitly). Also, we can show that the Dirac matrices 
must be even dimensional; see problem 1.1. Thus their minimum size is 
4x4, and it remains for us to interpret the two extra possible “spin” states. 

However, these extra states cause a more severe problem than a mere 
overcounting. Acting on a momentum eigenstate, H becomes the matrix 
cap + ?nc 2 /3. In problem 1.1, we find that the trace of this matrix is zero. 
Thus the four eigenvalues must be -KE'(p), +I?(p), — i?(p), —E{ p), where 
E(p) = +(p 2 c 2 + m 2 c 4 ) 1//2 . The negative eigenvalues are the problem: 
they indicate that there is no ground state. In a more elaborate theory 
that included interactions with photons, there seems to be no reason why 
a positive energy electron could not emit a photon and drop down into 
a negative energy state. This downward cascade could continue forever. 
(The same problem also arises in attempts to interpret the Klein-Gordon 
equation as a modified form of quantum mechanics.) 

Dirac made a wildly brilliant attempt to fix this problem of negative 
energy states. His solution is based on an empirical fact about electrons: 
they obey the Pauli exclusion principle. It is impossible to put more than 
one of them in the same quantum state. What if, Dirac speculated, all 
the negative energy states were already occupied ? In this case, a positive 
energy electron could not drop into one of these states, by Pauli exclusion. 

Many questions immediately arise. Why don’t we see the negative elec- 
tric charge of this Dirac sea of electrons? Dirac’s answer: because we’re 
used to it. (More precisely, the physical effects of a uniform charge density 
depend on the boundary conditions at infinity that we impose on Maxwell’s 
equations, and there is a choice that renders such a uniform charge density 
invisible.) However, Dirac noted, if one of these negative energy electrons 
were excited into a positive energy state (by, say, a sufficiently energetic 
photon), it would leave behind a hole in the sea of negative energy elec- 
trons. This hole would appear to have positive charge, and positive energy. 
Dirac therefore predicted (in 1927) the existence of the positron , a particle 
with the same mass as the electron, but opposite charge. The positron was 
found experimentally five years later. 
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However, we have now jumped from an attempt at a quantum descrip- 
tion of a single relativistic particle to a theory that apparently requires 
an infinite number of particles. Even if we accept this, we still have not 
solved the problem of how to describe particles like photons or pions or 
alpha nuclei that do not obey Pauli exclusion. 

At this point, it is worthwhile to stop and reflect on why it has proven 
to be so hard to find an acceptable relativistic wave equation for a sin- 
gle quantum particle. Perhaps there is something wrong with our basic 
approach. 

And there is. Recall the axiom of quantum mechanics that says that 
“Observables are represented by hermitian operators.” This is not entirely 
true. There is one observable in quantum mechanics that is not represented 
by a hermitian operator: time. Time enters into quantum mechanics only 
when we announce that the “state of the system” depends on an extra 
parameter t. This parameter is not the eigenvalue of any operator. This is 
in sharp contrast to the particle’s position x, which is the eigenvalue of an 
operator. Thus, space and time are treated very differently, a fact that is 
obscured by writing the Schrodinger equation in terms of the position-space 
wave function ^>(x, t). Since space and time are treated asymmetrically, it 
is not surprising that we are having trouble incorporating a symmetry that 
mixes them up. 

So, what are we to do? 

In principle, the problem could be an intractable one: it might be im- 
possible to combine quantum mechanics and relativity. In this case, there 
would have to be some meta-theory, one that reduces in the nonrelativistic 
limit to quantum mechanics, and in the classical limit to relativistic particle 
dynamics, but is actually neither. 

This, however, turns out not to be the case. We can solve our problem, 
but we must put space and time on an equal footing at the outset. There 
are two ways to do this. One is to demote position from its status as an 
operator, and render it as an extra label, like time. The other is to promote 
time to an operator. 

Let us discuss the second option first. If time becomes an operator, what 
do we use as the time parameter in the Schrodinger equation? Happily, in 
relativistic theories, there is more than one notion of time. We can use the 
proper time r of the particle (the time measured by a clock that moves 
with it) as the time parameter. The coordinate time T (the time measured 
by a stationary clock in an inertial frame) is then promoted to an operator. 
In the Heisenberg picture (where the state of the system is fixed, but the 
operators are functions of time that obey the classical equations of motion), 
we would have operators AT(r), where X° = T. Relativistic quantum 
mechanics can indeed be developed along these lines, but it is surprisingly 
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complicated to do so. (The many times are the problem; any monotonic 
function of r is just as good a candidate as r itself for the proper time, and 
this infinite redundancy of descriptions must be understood and accounted 
for.) 

One of the advantages of considering different formalisms is that they 
may suggest different directions for generalizations. For example, once 
we have AT(r), why not consider adding some more parameters? Then 
we would have, for example, X^(a, r). Classically, this would give us a 
continuous family of worldlines, what we might call a worldsheet , and so 
(ct, r) would describe a propagating string. This is indeed the starting 
point for string theory. 

Thus, promoting time to an operator is a viable option, but is compli- 
cated in practice. Let us then turn to the other option, demoting position 
to a label. The first question is, label on what? The answer is, on oper- 
ators. Thus, consider assigning an operator to each point x in space; call 
these operators <^(x). A set of operators like this is called a quantum field. 
In the Heisenberg picture, the operators are also time dependent: 

¥>(x, t) = e im/7 V(x, 0)e~ im/h . (1.29) 


Thus, both position and (in the Heisenberg picture) time are now labels on 
operators; neither is itself the eigenvalue of an operator. 

So, now we have two different approaches to relativistic quantum theory, 
approaches that might, in principle, yield different results. This, however, 
is not the case: it turns out that any relativistic quantum physics that can 
be treated in one formalism can also be treated in the other. Which we 
use is a matter of convenience and taste. And, quantum field theory , the 
formalism in which position and time are both labels on operators, is much 
more convenient and efficient for most problems. 

There is another useful equivalence: ordinary nonrelativistic quantum 
mechanics, for a fixed number of particles, can be rewritten as a quantum 
field theory. This is an informative exercise, since the corresponding physics 
is already familiar. Let us carry it out. 

Begin with the position-basis Schrodinger equation for n particles, all 
with the same mass rn. moving in an external potential U (x), and inter- 
acting with each other via an interparticle potential H(xi — X 2 ): 


= 


n / t 2 \ n j - 1 

E + u (*i) +EE y ( x w x *) 

. j= 1 v / j= 1 fc=i 




(1.30) 


where • • • 1 x n ; t) is the position-space wave function. The quan- 

tum mechanics of this system can be rewritten in the abstract form of 
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eq. (1.1) by first introducing (in, for now, the Schrodinger picture) a quan- 
tum field a(x) and its hermitian conjugate a^(x). We take these operators 
to have the commutation relations 

[a(x), a(x / )] = 0 , 

[a t (x),a t (x')] = 0 , 

[a(x), a^x')] = <5 3 (x - x') , (1.31) 

where h 3 (x) is the three-dimensional Dirac delta function. Thus, a^(x) and 
a(x) behave like harmonic-oscillator creation and annihilation operators 
that are labeled by a continuous index. In terms of them, we introduce the 
hamiltonian operator of our quantum field theory, 

H = J d 3 x a t(x)(-|iv 2 + £7(x))a(x) 

+ \ J Ad 3 i/h(x-y)a t (x)fl t (y)fl(y)a(x). (1.32) 

Now consider a time-dependent state of the form 

\ip,t) = /d 3 xi...d 3 x^(xi,...,x n ;t)at(xi)...at(x„)|0), (1.33) 

where VKxij • ■ ■ ,x n ; t) is some function of the n particle positions and time, 

and |0) is the vacuum state, the state that is annihilated by all the a’s, 

a(x)|0) = 0 . (1.34) 

It is now straightforward (though tedious) to verify that eq. (1.1), the ab- 
stract Schrodinger equation, is obeyed if and only if the function •*/> satisfies 
eq. (1.30). 

Thus we can interpret the state |0) as a state of “no particles”, the state 
a^(xi)|0) as a state with one particle at position xi, the state a' (xi)Gd(x 2 )| 0 ) 
as a state with one particle at position xi and another at position X 2 , and 
so on. The operator 

N = j d 3 x a^(x)a(x) (1.35) 

counts the total number of particles. It commutes with the hamiltonian, 
as is easily checked; thus, if we start with a state of n particles, we remain 
with a state of n particles at all times. 

However, we can imagine generalizations of this version of the theory 
(generalizations that would not be possible without the field formalism) in 
which the number of particles is not conserved. For example, we could try 
adding to H a term like 

AH ex J d 3 x a^(x)a 2 (x) + h.c. . 


(1.36) 
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This term does not commute with N, and so the number of particles would 
not be conserved with this addition to H. 

Theories in which the number of particles can change as time evolves are 
a good thing: they are needed for correct phenomenology. We are already 
familiar with the notion that atoms can emit and absorb photons, and so 
we had better have a formalism that can incorporate this phenomenon. We 
are less familiar with emission and absorption (that is to say, creation and 
annihilation) of electrons, but this process also occurs in nature; it is less 
common because it must be accompanied by the emission or absorption of 
a positron, antiparticle to the electron. There are not a lot of positrons 
around to facilitate electron annihilation, while e + e _ pair creation requires 
us to have on hand at least 2 me 2 of energy available for the rest-mass 
energy of these two particles. The photon, on the other hand, is its own 
antiparticle, and has zero rest mass; thus photons are easily and copiously 
produced and destroyed. 

There is another important aspect of the quantum theory specified by 
eqs. (1-32) and (1.33). Because the creation operators commute with each 
other, only the completely symmetric part of if survives the integration 
in eq. (1.33). Therefore, without loss of generality, we can restrict our 
attention to ip's of this type: 

if (- . . Xj . . . Xj . . . ;t) = +if(. ■ • Xj . . . Xj . . . ; t) . (1-37) 

This means that we have a theory of bosons, particles that (like photons or 
pions or alpha nuclei) obey Bose-Einstein statistics. If we want Fermi-Dirac 
statistics instead, we must replace eq. (1.31) with 

{a(x),a(x')} = 0 , 

{a t (x),a t (x / )} = 0 , 

{a(x), at(x 7 )} = <5 3 (x — x 7 ) , (1.38) 

where again {A, B} = AB + BA is the anticommutator. Now only the fully 
antisymmetric part of if survives the integration in eq. (1.33), and so we 
can restrict our attention to 

Y>(. . . Xj . . . xj . . . ; t) = -if (. . . Xj . . . Xj . . . ; t) . (1.39) 

Thus we have a theory of fermions. It is straightforward to check that 
the abstract Schrodinger equation, eq. (1.1), still implies that if obeys the 
differential equation (1.30). 1 Interestingly, there is no simple way to write 

1 Now, however, the ordering of the a and af operators in the last term of eq. (1.32) 

becomes significant, and must be as written. 
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down a quantum field theory with particles that obey Boltzmann statistics, 
corresponding to a wave function with no particular symmetry. This is a 
hint of the spin- statistics theorem, which applies to relativistic quantum 
field theory. It says that interacting particles with integer spin must be 
bosons, and interacting particles with half-integer spin must be fermions. 
In our nonrelativistic example, the interacting particles clearly have spin 
zero (because their creation operators carry no labels that could be inter- 
preted as corresponding to different spin states), but can be either bosons 
or fermions, as we have seen. 

Now that we have seen how to rewrite the nonrelativistic quantum me- 
chanics of multiple bosons or fermions as a quantum field theory, it is time 
to try to construct a relativistic version. 

Reference Notes 

The history of the physics of elementary particles is recounted in Pais. A 
brief overview can be found in Weinberg I. More details on quantum field 
theory for nonrelativistic particles can be found in Brown. 

Problems 

1.1) Show that the Dirac matrices must be even dimensional. Hint: show 
that the eigenvalues of (3 are all ±1, and that TV/3 = 0. To show that 
Tr (3 = 0, consider, e.g., TV a\(3. Similarly, show that Tra* = 0. 

1.2) With the hamiltonian of eq. (1.32), show that the state defined in 
eq. (1.33) obeys the abstract Schrodinger equation, eq. (1.1), if and 
only if the wave function obeys eq. (1.30). Your demonstration should 
apply both to the case of bosons, where the particle creation and anni- 
hilation operators obey the commutation relations of eq. (1.31), and 
to fermions, where the particle creation and annihilation operators 
obey the anticommutation relations of eq. (1.38). 

1.3) Show explicitly that [N, H] = 0, where H is given by eq. (1.32) and 
N by eq. (1.35). 



2 Lorentz Invariance 

Prerequisite: 1 


A Lorentz transformation is a linear, homogeneous change of coordinates 
from x p to x p , 

x p = A p v x v , (2.1) 

that preserves the interval x 2 between x p and the origin, where 

x 2 = x p x p = g pA /x p jA = x 2 - c 2 t 2 . (2.2) 

This means that the matrix must obey 

g pv A p p A v a = g pa , (2.3) 

where 

-1 

+1 

+1 

+1 

is the Minkowski metric. 

Note that this set of transformations includes ordinary spatial rotations: 
take A°o = 1, A°j = A*o = 0, and A l j = Rjj , where R is an orthogonal 
rotation matrix. 

The set of all Lorentz transformations forms a group : the product of 
any two Lorentz transformations is another Lorentz transformation; the 
product is associative; there is an identity transformation, A p „ = 5 ll u ] 
and every Lorentz transformation has an inverse. It is easy to demonstrate 
these statements explicitly. For example, to find the inverse transformation 
(A -1 )^,,, note that the left-hand side of eq. (2.3) can be written as A up A u a , 
and that we can raise the p index on both sides to get A u p A u a = 5 p a . On 
the other hand, by definition, (A~ l ) p U A U a = 5 p a . Therefore 

(A-y, = A/ . (2.5) 

Another useful version of eq. (2.3) is 

gT A%A\ = g p ° . (2.6) 

To get eq. (2.6), start with eq. (2.3), but with the inverse transformations 
(A~y p and (A” 1 )^. Then use eq. (2.5), raise all down indices, and lower 
all up indices. The result is eq. (2.6). 

For an infinitesimal Lorentz transformation, we can write 




A p v = 5 P V + 5u\ . 


(2.7) 
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Eq. (2.3) can be used to show that 5u with both indices down (or up) is 
antisymmetric: 

5oJp(j — 5tU,jp • ( 2 . 8 ) 

Thus there are six independent infinitesimal Lorentz transformations (in 
four spacetime dimensions). These can be divided into three rotations 
(Suij = — EijkhkSd for a rotation by angle 56 about the unit vector n) and 
three boosts (Scjio = niSrj for a boost in the direction n by rapidity 5rj). 

Not all Lorentz transformations can be reached by compounding in- 
finitesimal ones. If we take the determinant of eq. (2.5), we get (det A) -1 = 
detA, which implies det A = ±1. Transformations with det A = +1 are 
proper, and transformations with detA = — 1 are improper. Note that the 
product of any two proper Lorentz transformations is proper, and that 
infinitesimal transformations of the form A = 1 + 5lo are proper. There- 
fore, any transformation that can be reached by compounding infinitesimal 
ones is proper. The proper transformations form a subgroup of the Lorentz 
group. 

Another subgroup is that of the orthochronous Lorentz transformations: 
those for which A°o > +1. Note that eq. (2.3) implies (A°o) 2 — A*oA*o = 1; 
thus, either A°o > +1 or A°o < —1. An infinitesimal transformation is 
clearly orthochronous, and it is straightforward to show that the product 
of two orthochronous transformations is also orthochronous. 

Thus, the Lorentz transformations that can be reached by compounding 
infinitesimal ones are both proper and orthochronous, and they form a 
subgroup. We can introduce two discrete transformations that take us out 
of this subgroup: parity and time reversal. The parity transformation is 


P^u = {V-'Yv 


( +1 \ 

-1 

-1 

V -i / 


(2.9) 


It is orthochronous, but improper. The time-reversal transformation is 


= (T~ l y v 


( - 1 
V 


\ 

+i 

+i 

+i/ 


( 2 . 10 ) 


It is nonorthochronous and improper. 

Generally, when a theory is said to be Lorentz invariant , this means 
under the proper orthochronous subgroup only. Parity and time reversal 
are treated separately. It is possible for a quantum field theory to be 
invariant under the proper orthochronous subgroup, but not under parity 
and/or time-reversal. 
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From here on, in this section, we will treat the proper orthochronous 
subgroup only. Parity and time reversal will be treated in section 23. 

In quantum theory, symmetries are represented by unitary (or antiu- 
nitary) operators. This means that we associate a unitary operator 17(A) 
to each proper, orthochronous Lorentz transformation A. These operators 
must obey the composition rule 

U( A' A) = U(A')U{A) . (2.11) 

For an infinitesimal transformation, we can write 

U(l+6u>) = 1+ , (2.12) 

where M IW = —M vp is a set of hermitian operators called the generators 
of the Lorentz group. If we start with U(A)~ 1 U (A')U (A) = 17(A~ 1 A / A), let 
A' = 1 + 5io' , and expand both sides to linear order in Sco, we get 

5uj f _ lu U(A)~ 1 M pv U (A) = 5eo^A%A\M pa . (2.13) 

Then, since is arbitrary (except for being antisymmetric), the anti- 
symmetric part of its coefficient on each side must be the same. In this 
case, because M IW is already antisymmetric (by definition), we have 

U(A)~ 1 M pu U(A) = A p p A u a M pa . (2.14) 

We see that each vector index on M IW undergoes its own Lorentz trans- 
formation. This is a general result: any operator carrying one or more 
vector indices should behave similarly. For example, consider the energy- 
momentum four- vector P p , where P° is the hamiltonian H and P' are the 
components of the total three-momentum operator. We expect 

U(A)~ l P p U{A) = A p v P v . (2.15) 

If we now let A = 1 + 5lo in eq. (2.14), expand to linear order in 5lu, 
and equate the antisymmetric part of the coefficients of Su^, we get the 
commutation relations 

[A'P w ,M pcT } = ih(g pp M u,y - (/x<-n/)) - (pv^a) . (2.16) 

These commutation relations specify the Lie algebra of the Lorentz group. 
We can identify the components of the angular momentum operator J as 
Jj = and the components of the boost operator K as Kj = M i0 . 

We then find from eq. (2.16) that 

[A , 'I j\ — ihSijkJk i 
•7 , Kj] — ittSijkLs-k , 

[Ki, Kj] — iTiSij }■.]}■ 


(2.17) 
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The first of these is the usual set of commutators for angular momentum, 
and the second says that K transforms as a three- vector under rotations. 
The third implies that a series of boosts can be equivalent to a rotation. 
Similarly, we can let A = 1 + 8u in eq. (2.15) to get 

[P p , M p<r ] = ih(g»°pp _ , (2.18) 

which becomes 


= 0 , 

Pj\ — ih^ijkPk j 

[Ki,H] = ihPi , 

[Ki,Pj] = i%5ijH , (2.19) 

Also, the components of P ^ should commute with each other: 

[P-, Pj] = 0 , 

[Pi,H] = 0. (2.20) 

Together, eqs. (2.17), (2.19), and (2.20) form the Lie algebra of the Poincare 
group. 

Let us now consider what should happen to a quantum scalar field ip{x) 
under a Lorentz transformation. We begin by recalling how time evolution 
works in the Heisenberg picture: 


e + ifff / R ^(x, 0)e~ iHt/h = <p(x,t) . 

(2.21) 

Obviously, this should have a relativistic generalization, 


e - iPx l % ip{ 0)e +iPx / n = <p{x) , 

(2.22) 

where Px = P^x^ = P • x — Hct. We can make this a 
defining the unitary spacetime translation operator 

little fancier by 

T(a ) = exp(— iP^a^/Ti) . 

(2.23) 

Then we have 

T(a)~ 1 ip(x)T(a) = tp(x — a) . 

(2.24) 

For an infinitesimal translation, 


T(Sa) =1- jrS a , . 

(2.25) 


Comparing eqs. (2.12) and (2.25), we see that eq. (2.24) leads us to expect 
t/(A)~V(^)t / (A) = 9?(A -1 x) . (2.26) 
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Derivatives of ip then carry vector indices that transform in the appropriate 
way, e.g., 

U(A)~ 1 d p- ip(x)U(A) = A^pdP^A^x) , (2.27) 

where the bar on a derivative means that it is with respect to the argument 
x = A~ l x. Eq. (2.27) also implies 

U(A)~ 1 d 2 ip(x)U(A) = d 2 ip(A~ 1 x) , (2.28) 

so that the Klein-Gordon equation, (— d 2 + m 2 /fi 2 c 2 )<p = 0, is Lorentz 
invariant, as we saw in section 1. 

Reference Notes 

A detailed discussion of quantum Lorentz transformations can be found in 
Weinberg I. 

Problems 

2.1) Verify that eq. (2.8) follows from eq. (2.3). 

2.2) Verify that eq. (2.14) follows from U(A)~ 1 U(A')U(A) = U(A~ 1 A'A). 

2.3) Verify that eq. (2.16) follows from eq. (2.14). 

2.4) Verify that eq. (2.17) follows from eq. (2.16). 

2.5) Verify that eq. (2.18) follows from eq. (2.15). 

2.6) Verify that eq. (2.19) follows from eq. (2.18). 

2.7) What property should be attributed to the translation operator T(a) 
that could be used to prove eq. (2.20)? 

2.8) a) Let A = 1 + Sin in eq. (2.26), and show that 

b(x),MH=£'>(x), (2.29) 

where 

CJ W = \{x p d v - x v d p ) . (2.30) 

b) Show that [[<p(x), M^],M pa ] = 0* tp(x) . 

c) Prove the Jacobi identity , [[A, B],C] + [[B, C], A] + [[(7, A], B\ = 0. 
Hint: write out all the commutators. 

d) Use your results from parts (b) and (c) to show that 

[<p(x), [M^,M pcT ]} = (C pv C pa - C pa jC pu )(p(x) . 


(2.31) 
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e) Simplify the right-hand side of eq. (2.31) as much as possible. 

f) Use your results from part (e) to verify eq. (2.16), up to the possi- 
bility of a term on the right-hand side that commutes with <p(x ) and 
its derivatives. (Such a term, called a central charge , in fact does not 
arise for the Lorentz algebra.) 

2.9) Let us write 

A p t = S p T + ^uv(SvTr , (2.32) 

where 

(SD P r = 7(/Vr-m (2.33) 

are matrices which constitute the vector representation of the Lorentz 
generators. 

a) Let A = 1 + 5lo in eq. (2.27), and show that 

[d p y{x), A/H = C pv d p ip{x) + (Sn p r3>(x) • (2.34) 


b) Show that the matrices S pu must have the same commutation 
relations as the operators M pv . Hint: see the previous problem. 

c) For a rotation by an angle 9 about the z axis, we have 


/I 0 0 0\ 

0 cos 9 — sin 9 0 

0 sin 9 cos 9 0 

\0 0 01 / 


Show that 

A = exp(— i9Sy 2 /h) . 


(2.35) 


(2.36) 


d) For a boost by rapidity r) in the z direction, we have 


A^ = 


( cosh 7] 
0 
0 

V sinh r/ 


0 

1 

0 

0 


0 

0 

1 

0 


sinh i] \ 
0 
0 

cosh 7 ] ) 


(2.37) 


Show that 


A = exp(+ir)Sv° /h) ■ 


(2.38) 
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3 Canonical Quantization of Scalar Fields 

Prerequisite: 2 

Let us go back and drastically simplify the hamiltonian we constructed in 
section 1, reducing it to the hamiltonian for free particles: 

H = f d 3 x a t (x)(— 2 ^V 2 )a(x) 

= f d3 P 2^iP 2 st (p)«(p) > (3- 1 ) 

where 

S(p) = / (2^72 e "‘ P X “ (X) ' (3 2) 

Here we have simplified our notation by setting 

Ti = 1 . (3.3) 

The appropriate factors of fi can always be restored in any of our formulas 
via dimensional analysis. The commutation (or anticommutation) relations 
of the a(p) and <F(p) operators are 

[«(p),S(p')] =F = 0 , 

[a t (p),a t (p')] =F = 0 , 

[a(p),a t (p')] =F = <5 3 (p - p') , (3.4) 

where [A, B]^ is either the commutator (if we want a theory of bosons) 
or the anticommutator (if we want a theory of fermions). Thus a'l’(p) can 
be interpreted as creating a state of definite momentum p, and eq. (3.1) 
describes a theory of free particles. The ground state is the vacuum |0); it 
is annihilated by a(p), 

5(p)|0) = 0 , (3.5) 

and so its energy eigenvalue is zero. The other eigenstates of H are all of 
the form o^pi) . . . o' (p n )|0), and the corresponding energy eigenvalue is 
E( Pi) + • • • + E{p n ), where E{ p) = ^p 2 . 

It is easy to see how to generalize this theory to a relativistic one; all we 
need to do is use the relativistic energy formula E( p) = +(p 2 c 2 + m 2 c 4 ) 1 / 2 : 

H = J d 3 p (p 2 c 2 + m 2 c 4 ) 1//2 cQp)a(p) . (3.6) 

Now we have a theory of free relativistic spin-zero particles, and they can 
be either bosons or fermions. 
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Is this theory really Lorentz invariant? We will answer this question (in 
the affirmative) in a very roundabout way: by constructing it again, from 
a rather different point of view, a point of view that emphasizes Lorentz 
invariance from the beginning. 

We will start with the classical physics of a real scalar field (p(x). Real 
means that f(x) assigns a real number to every point in spacetime. Scalar 
means that Alice [who uses coordinates and calls the field ty{x)\ and Bob 
[who uses coordinates x M , related to Alice’s coordinates by x M = A fl v x l '+a 1 ' , 
and calls the field (p(x)\, agree on the numerical value of the field: <p(x) = 
<p(x). This then implies that the equation of motion for tp(x) must be the 
same as that for <p(x). We have already met an equation of this type: the 
Klein-Gordon equation, 


(— d 2 + m 2 )<p(x) = 0 . (3.7) 

Here we have simplified our notation by setting 

c = 1 (3.8) 

in addition to h = 1. As with h, factors of c can restored, if desired, by 
dimensional analysis. 

We will adopt eq. (3.7) as the equation of motion that we would like 
<p(x) to obey. It should be emphasized at this point that we are doing 
classical physics of a real scalar field. We are not to think of tp(x) as a 
quantum wave function. Thus, there should not be any factors of h in this 
version of the Klein-Gordon equation. This means that the parameter m 
must have dimensions of inverse length; m is not (yet) to be thought of as 
a mass. 

The equation of motion can be derived from variation of an action 
S = J dtL, where L is the lagrangian. Since the Klein-Gordon equation is 
local, we expect that the lagrangian can be written as the space integral of 
a lagrangian density C\ L = / d 3 x£. Thus, S = / d 4 x£. The integration 
measure d 4 x is Lorentz invariant: if we change to coordinates x ^ = A^x 1 ', 
we have d A x = | det A| d 4 x = d A x. Thus, for the action to be Lorentz in- 
variant, the lagrangian density must be a Lorentz scalar: £(x) = C{x). 
Then we have S = f d 4 x C{ x) = / d 4 x £(x) = S. Any simple function of 
<p is a Lorentz scalar, and so are products of derivatives with all indices 
contracted, such as d^ipd^ip. We will take for C 

C = — \m 2 <p 2 + , (3.9) 

where Ho is an arbitrary constant. We find the equation motion (also known 
as the Euler- Lagrange equation ) by making an infinitesimal variation 5ip(x) 
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in p(x), and requiring the corresponding variation of the action to vanish: 

0 = ss 

= J d A x — ^d^dpd^p — ^d^pd^dp — m 2 pdp 

= J d A x Fd^d^ip — m 2 p dip. (3.10) 

In the last line, we have integrated by parts in each of the first two terms, 
putting both derivatives on p. We assume dp(x) vanishes at infinity in 
any direction (spatial or temporal), so that there is no surface term. Since 
5p has an arbitrary x dependence, eq. (3.10) can be true if and only if 
(— d 2 + m 2 )p = 0. 

One solution of the Klein-Gordon equation is a plane wave of the form 
exp(zk-x ± iut), where k is an arbitrary real wave- vector, and 

oj = +(k 2 + m 2 ) 1 / 2 . (3.11) 

The general solution (assuming boundary conditions that require p to re- 
main finite at spatial infinity) is then 

r d 3 k r n 

p(x.,t)= / -jjyr a{k)e ik *- iujt + b{k)e ik x+iu}t , (3.12) 

J f(k) L J 

where a(k) and 6(k) are arbitrary functions of the wave vector k, and f(k) 
is a redundant function of the magnitude of k which we have inserted for 
later convenience. Note that, if we were attempting to interpret p(x) as 
a quantum wave function (which we most definitely are not), then the 
second term would constitute the “negative energy” contributions to the 
wave function. This is because a plane-wave solution of the nonrelativistic 
Schrodinger equation for a single particle looks like exp(?'p • x — iE(p)t), 
with i?(p) = 2^7 P 2 ; there is a minus sign in front of the positive energy. We 
are trying to interpret eq. (3.12) as a real classical field, but this formula 
does not generically result in p being real. We must impose p*(x) = p{x ) , 
where 

¥>*(x,t) = J [a*(k)e-* k ' x+iu,t + 6*(k)e- <k ' x - <wt 

r d 3 k r 1 

= —— a *(k)e~ ik * +iujt + 6*(— k)e+* kx -^ . (3.13) 

J f{k) L 

In the second term on the second line, we have changed the dummy inte- 
gration variable from k to — k. Comparing eqs. (3.12) and (3.13), we see 
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that = <p(x) requires b*(—k) = a(k). Imposing this condition, we 

can rewrite <p as 


<p(x,t) 



o( k)e ikx ~ iuJt + o*(- k)e ikx+iuJt 



a(k)e ikx ~ iut + a* (k)e~ ik ' x+iut 



a(k)e ikx + a*(k)e~ ikx 


(3.14) 


where kx = k • x — cot is the Lorentz-invariant product of the four- vectors 
x v = (t, x) and k ^ = (w, k): kx = k^x^ = g^k^x” . Note that 

k 2 = k^kfj, = k 2 — oj 2 = — m 2 . (3.15) 

A four-momentum k ^ that obeys k 2 = — m 2 is said to be on the mass shell , 
or on shell for short. 

It is now convenient to choose f(k) so that d?k/ f(k) is Lorentz invariant. 
An integration measure that is manifestly invariant under orthochronous 
Lorentz transformations is d^k 5(k 2 +m 2 ) 0(k°), where 5(x) is the Dirac delta 
function, 0(x) is the unit step function, and k° is treated as an independent 
integration variable. We then have 

/ +0O 1 

dk° 5(k 2 +m 2 ) 0(k°) = — . (3.16) 

-OO 

Here we have used the rule 

/ +0O 1 

-oo dxS(9(x)) = ^WM’ (3 - 17) 


where g(x) is any smooth function of x with simple zeros at x = Xi\ in our 
case, the only zero is at k° = u. 

Thus we see that if we take f(k ) oc u, then d 3 k/f(k ) will be Lorentz 
invariant. We will take f(k) = (27 t) 3 2cu. It is then convenient to give the 
corresponding Lorentz-invariant differential its own name: 


dk = 


d 3 k 

( 2tt) 3 2lo 


(3.18) 


Thus we finally have 

<p(x) = [dk \a(k)e ikx + a*(k)e~ ikx 


(3.19) 
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We can also invert this formula to get a(k) in terms of p(x). We have 


A e-“>(x) = Aa(k) + J-e 2i “‘a*(-k) 


I d?x e- itx So<p( x) = -|a(k) + ie 2 ”‘a*(-k) 
We can combine these to get 

a(k) = J d 3 x e~ lkx idop(x) + uj<p(x) 

= i [ d 3 x e~ ikx d 0 p(x) , 


(3.20) 


(3.21) 


where fd^g = /{d^g) — {d^f)g, and dop = dp/dt = p. Note that a(k) is 
time independent. 

Now that we have the lagrangian, we can construct the hamiltonian by 
the usual rules. Recall that, given a lagrangian L(qi,qi ) as a function of 
some coordinates q* and their time derivatives q t , the conjugate momenta 
are given by pi = dL/dqi, and the hamiltonian by H = J2iPiQi ~ L. In 
our case, the role of q % (t) is played by p(x, t), with x playing the role of a 
(continuous) index. The appropriate generalizations are then 


. dL 

n(^) = - X 

dp{x) 

(3.22) 

n = up- £ , 

(3.23) 


where 74 is the hamiltonian density , and the hamiltonian itself is H = 
J d 3 x Ft. In our case, we have 


n(x) = p(x) (3.24) 

and 

H = \H 2 + \{Vp) 2 + \m 2 p 2 -Flo . ( 3 . 25 ) 

Using eq. ( 3 . 19 ), we can write H in terms of the a(k) and a*(k) coefficients: 

H = — UoU + \ J dk dk d 3 x 

(~iu>a(k)e ikx + iua*(k)e- ikx ) (-iJ a(k!)e ik ' x + iJ a* (k!)e~ ik ' x ) 
+ {^+ika(k)e ikx - ika* {k)e~ ikx ) • (+ik' a( k!)e ik ' x - ik 1 a*( k')e~ ik ' x ) 
+ m 2 (a(k)e ikx + a*( k)e~ ikx ) (a(k')e ik ' x + a* (k')e~ ik ' x J 
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= — + ^(2ir) 3 J dk dk 

d 3 (k — k')(+ww' + k-k ' + ?n 2 ) 
x (a*(k)a(k , )e- i(uJ - u; ' )t + a(k)a*(k / )e +i(a ’- aJ,)t ) 

+ <J 3 (k + k / )(— ioJ — k-k' + m 2 ) 

x (a(k)a(k')e- i(aJ+u; ' )t + a*(k)a*(k')e + ^ +a/) *) 

= -<W + | Jdk±\ 

( +uj 2 + k 2 + m 2 ) (a* (k)a(k) + a(k)a* (k)^ 

+ (-w 2 + k 2 + m 2 )(a(k)a(~k)e- 2 ^ + a*(k)a*(-k)e +2i ^)" 

= —VLqV + ^2 j dk a;^a*(k)a(k) + a(k)a*(k)^ , (3.26) 

where V is the volume of space. To get the second equality, we used 

J d 3 x e iq x = (2vr) 3 5 3 (q) . (3.27) 

To get the third equality, we integrated over k', using dk = d 3 k' / ( 27 t) 3 2 u/ . 
The last equality then follows from uj = (k 2 +m 2 ) 1 / 2 . Also, we were careful 
to keep the ordering of a(k) and a*(k) unchanged throughout, in anticipa- 
tion of passing to the quantum theory where these classical functions will 
become operators that may not commute. 

Let us take up the quantum theory now. We can go from classical 
to quantum mechanics via canonical quantization. This means that we 
promote q t and p t to operators, with commutation relations [(/,;, qj\ = 0, 
\pi,Pj] = 0, and [qi,Pj] = ifiSij. In the Heisenberg picture, these operators 
should be taken at equal times. In our case, where the “index” is continuous 
(and we have set Ti = 1), we have 

[^(x,t),^( x ',t)] = 0 , 

[n(x,f),n(x',t)] = o , 

[</?(x, t), n(x', £)] = id 3 (x — x') . (3.28) 


From these canonical commutation relations, and from eqs. (3.21) and (3.24) 
we can deduce 


[a(k),u(k')] = 0 , 

[o t (k),a t (k')] = 0 , 

[a(k), a^(k')] = ( 27 r) 3 2 u; A 3 (k — k') . 


(3.29) 
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We are now denoting a*(k) as a^(k), since ad(k) is now the hermitian 
conjugate (rather than the complex conjugate) of the operator a(k). We 
can now rewrite the hamiltonian as 

H = Jdkujo 1 (k)o(k) + (So - n 0 )V , (3.30) 

where 

So = J d 3 k co (3.31) 

is the total zero-point energy of all the oscillators per unit volume, and, 
using eq. (3.27), we have interpreted (27r) 3 <5 3 (0) as the volume of space V. 

If we integrate in eq. (3.31) over the whole range of k, the value of So is 
infinite. If we integrate only up to a maximum value of A, a number known 
as the ultraviolet cutoff . , we find 

A 4 

£ » = me • < 3 - 32 > 

where we have assumed A m. This is physically justified if, in the real 
world, the formalism of quantum field theory breaks down at some large 
energy scale. For now, we simply note that the value of Ho is arbitrary, and 
so we are free to choose Ho = So- With this choice, the ground state has 
energy eigenvalue zero. Now, if we like, we can take the limit A — > oo, with 
no further consequences. (We will meet more of these ultraviolet divergences 
after we introduce interactions.) 

The hamiltonian of eq. (3.30) is now the same as that of eq. (3.6), with 
a(k) = [(27^) 3 2^u] 1/,2 a(k). The commutation relations (3.4) and (3.29) are 
also equivalent, if we choose commutators (rather than anticommutators) 
in eq. (3.4). Thus, we have re-derived the hamiltonian of free relativistic 
bosons by quantization of a scalar field whose equation of motion is the 
Klein- Gordon equation. The parameter m in the lagrangian is now seen to 
be the mass of the particle in the quantum theory. (More precisely, since 
m has dimensions of inverse length, the particle mass is hem.) 

What if we want fermions? Then we should use anticommutators in 
eqs. (3.28) and (3.29). There is a problem, though; eq. (3.26) does not then 
become eq. (3.30). Instead, we get H = — QoV, a simple constant. Clearly 
there is something wrong with using anticommutators. This is another hint 
of the spin-statistics theorem, which we will take up in section 4. 

Next, we would like to add Lorentz-invariant interactions to our theory. 
With the formalism we have developed, this is easy to do. Any local func- 
tion of ip(x) is a Lorentz scalar, and so if we add a term like p 3 or ip A to 
the lagrangian density £, the resulting action will still be Lorentz invariant. 
Now, however, we will have interactions among the particles. Our next task 
is to deduce the consequences of these interactions. 
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However, we already have enough tools at our disposal to prove the 
spin-statistics theorem for spin-zero particles, and that is what we turn to 
next. 


Problems 

3.1) Derive eq. (3.29) from eqs. (3.21), (3.24), and (3.28). 

3.2) Use the commutation relations, eq. (3.29), to show explicitly that a 
state of the form 


I Ah • • • K) = a f (ki) . . . a f (k n )|0) (3.33) 

is an eigenstate of the hamiltonian, eq. (3.30), with eigenvalue u\ + 
. . . +lo u . The vacuum |0) is annihilated by a(k), a(k)|0) = 0, and we 
take Ho = So hr eq. (3.30). 

3.3) Use U(A)~ 1 (p(x)U(A) = ip(A~ 1 x) to show that 

£/(A) _1 a(k)U(A) = a(A” 1 k) , 

[/(A)”V(k)[/(A) = a t (A~ 1 k) , (3.34) 

and hence that 

U(A)\ki . . . k n ) = |Afei . . . Ak n ) , (3.35) 

where \k\ . . . k n ) = a' (ki) . . . a^(k„)|0) is a state of n particles with 
momenta k\, ... ,k n . 

3.4) Recall that T(a)~ 1 i^(x)T(a) = <p(x — a), where T{a) = exp(— iP^a^) 
is the spacetime translation operator, and P° is identified as the 
hamiltonian H. 

a) Let a ^ be infinitesimal, and derive an expression for [ip(x), P^]. 

b) Show that the time component of your result is equivalent to the 
Heisenberg equation of motion iip = [tp,H]. 

c) For a free held, use the Heisenberg equation to derive the Klein- 
Gordon equation. 

d) Define a spatial momentum operator 

P = — J d 3 xU(x)V(p(x) . (3.36) 

Use the canonical commutation relations to show that P obeys the 
relation you derived in part (a). 

e) Express P in terms of a(k) and a^(k). 
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3.5) Consider a complex (that is, nonhermitian) scalar field p with la- 
grangian density 


C = —d^p^d^p — m 2 p^p + hlo • (3.37) 


a) Show that (p obeys the Klein-Gordon equation. 

b) Treat tp and p' as independent fields, and find the conjugate mo- 
mentum for each. Compute the hamiltonian density in terms of these 
conjugate momenta and the fields themselves (but not their time 
derivatives). 

c) Write the mode expansion of p as 


p{x) 


= / dk 


a(k)e ikx + 6 f (k)e 


—ikx 


(3.38) 


Express a(k) and 6(k) in terms of p and p 1 and their time derivatives. 

d) Assuming canonical commutation relations for the fields and their 
conjugate momenta, find the commutation relations obeyed by a(k) 
and 6(k) and their hermitian conjugates. 

e) Express the hamiltonian in terms of a(k) and 6(k) and their her- 
mitian conjugates. What value must flo have in order for the ground 
state to have zero energy? 



4: The Spin-Statistics Theorem 


45 


4 The Spin-Statistics Theorem 

Prerequisite: 3 

Let us consider a theory of free, spin-zero particles specified by the hamil- 
tonian 

Hq = J dk lo T(k)a(k) , (4.1) 

where lo = (k 2 + m 2 ) 1//2 , and either the commutation or anticommutation 
relations 

[a(k),a(k')T = 0 , 

[a t (k),a t (k / )] T = 0 , 

[a(k), a^k')]^ = (27r) 3 2cu <5 3 (k — k') . (4.2) 

Of course, if we want a theory of bosons, we should use commutators, and 
if we want fermions, we should use anticommutators. 

Now let us consider adding terms to the hamiltonian that will result in 
local, Lorentz invariant interactions. In order to do this, it is convenient to 
define a nonhermitian field, 

<T“(x, 0) = J dk e* k ' x a(k) , (4.3) 

and its hermitian conjugate 

ip- (x, 0 ) = Jdk e- lk x a\ k) . (4.4) 

These are then time-evolved with Hq: 

p+(x,t) = e^°V(x, 0)e~ iHot = Jdk e ikx o(k) , 

= e iHot <p-(x, 0)e~ iHot = Jdk e~ ikx T(k) . (4.5) 

Note that the usual hermitian free field ip(x ) is just the sum of these: 
<p(x) = y + {x) + <p~(x). 

For a proper orthochronous Lorentz transformation A, we have 

t/(A)~V(x) 17(A) = 99(A _1 .t) . (4.6) 

This implies that the particle creation and annihilation operators transform 
as 


U (A)~ 1 a(k)U (A) = a(A -1 k) , 
t/(A) _1 a t (k)L r (A) = a^A-T) . 


(4.7) 
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This, in turn, implies that <p + (x) and <p~(x) are Lorentz scalars: 

U (A)* 1 (p^ (x)U (A) = ^(A -1 ^) . (4.8) 

We will then have local, Lorentz invariant interactions if we take the in- 
teraction lagrangian density C\ to be a hermitian function of <p + (x) and 
<p~(x). 

To proceed we need to recall some facts about time-dependent pertur- 
bation theory in quantum mechanics. The transition amplitude to 

start with an initial state |i) at time t = — oo and end with a final state |/) 
at time t. = +oo is 

Tf<-i = (/| T exp -i / dt Hj(t) \i) , (4.9) 

J — OO J 

where Hj(t) is the perturbing hamiltonian in the interaction picture, 

Hj(t) = exp(+iHot) Hi exp(— iH$t) , (4-10) 


H i is the perturbing hamiltonian in the Schrodinger picture, and T is the 
time ordering symbol: a product of operators to its right is to be ordered, 
not as written, but with operators at later times to the left of those at earlier 
times. We write H\ = / d 3 x Lti(x, 0), and specify 7fi(x, 0) as a hermitian 
function of (/? + (x, 0) and </? - (x, 0). Then, using eqs. (4.5) and (4.10), we 
can see that, in the interaction picture, the perturbing hamiltonian density 
?f/(x, t) is simply given by the same function of 99 + (x, t) and (p~{x.,t). 

Now we come to the key point: for the transition amplitude Tj ^_ j to 
be Lorentz invariant, the time ordering must be frame independent. The 
time ordering of two spacetime points x and x' is frame independent if 
their separation is timelike ; this means that the interval between them is 
negative, (x — x') 2 < 0. Two spacetime points whose separation is spacelike , 
(x — x') 2 > 0, can have different temporal ordering in different frames. In 
order to avoid being different in different frames, we must then require 

\Hi(x),Tti{x')\ = 0 whenever [x — x') 2 > 0 . (4-11) 

Obviously, [</? + (x), (/? + (a/)]i F = [ip~(x), ^ {x')\-^ = 0. However, 

[< p + (x),ip-(x')\ T = Jdkdk e < ( fce - fcV )[ 0 ( k ),a t (k / )] T 


= J dk e iHx ~ x,) 


m 

4ir 2 r 


K\ ( mr ) 


= C(r) . 


(4.12) 
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In the next-to-last line, we have taken (x — x') 2 = r 2 > 0, and K\ (z) is 
a modified Bessel function. (This Lorentz-invariant integral is most easily 
evaluated in the frame where t! = t.) The function C(r) is not zero for any 
r > 0. (Not even when m = 0; in this case, C(r ) = l/47r 2 r 2 .) On the other 
hand, 7ii(x) must involve both <p + (x) and p~(x), by hermiticity. Thus, 
generically, we will not be able to satisfy eq. (4.11). 

To resolve this problem, let us try using only particular linear combi- 
nations of (p + (x) and ( x ) . Define 

<P\(x) = <P + (x) + , 

t{(x) = T~(x) + A> + (x) , (4.13) 

where A is an arbitrary complex number. We then have 

[¥>A (*),<?! (*')]? = b + (®)^ _ (®')]=F + W 2 [¥>“(*)>¥ ,+ (* , )]:f 

= (1 -F |A| 2 ) C(r) (4.14) 


and 


[yx{x),yx(x')\^ = \[y + (x),y (xOIt + A^ {.x),ip + {x')\^ 

= A(1 ^ l)C(r) . (4.15) 

Thus, if we want <p\{x) to either commute or anticommute with both px{x') 
and ^p\{x') at spacelike separations, we must choose |A| = 1, and, we must 
choose commutators. Then (and only then), we can build a suitable Tti(x) 
by making it a hermitian function of <p>\(x ). 

But this has simply returned us to the theory of a real scalar field, 
because, for A = e ia , e~ ia ^ 2 ipx( x ) is hermitian. In fact, if we make the 
replacements o(k) — > e + *“/ 2 a(k) and cJ(k) — > e~ ia ^ 2 a)( k), then the com- 
mutation relations of eq. (4.2) are unchanged, and e~ ia ^ 2 <px(x) = <p(x) = 
(p + (x) +(p~(x). Thus, our attempt to start with the creation and annihila- 
tion operators cd(k) and a(k) as the fundamental objects has simply led us 
back to the real, commuting, scalar field cp(x) as the fundamental object. 

Let us return to thinking of c p(x ) as fundamental, with a lagrangian den- 
sity given by some function of the Lorentz scalars <p(x) and d^^p{x)d^Lp{x). 
Then, quantization will result in \<p{x), ^p{x')\ ^ = 0 for t = t' . If we choose 
anticommutators, then [p{x)] 2 = 0 and [<9 M </?(x)] 2 = 0, resulting in a trivial 
C that is at most linear in <p, and independent of ip. This clearly does not 
lead to the correct physics. 

This situation turns out to generalize to fields of higher spin, in any 
number of spacetime dimensions. One choice of quantization (commuta- 
tors or anticommutators) always leads to a trivial £, and so this choice 
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is disallowed. Furthermore, the allowed choice is always commutators for 
fields of integer spin, and anticommutators for fields of half-integer spin. 
If we try treating the particle creation and annihilation operators as fun- 
damental, rather than the fields, we find a situation similar to that of the 
spin-zero case, and are led to the reconstruction of a field that must obey 
the appropriate quantization scheme. 

Reference Notes 

This discussion of the spin-statistics theorem follows that of Weinberg /, 
which has more details. 

Problems 

4.1) Verify eq. (4.12). Verify its limit as m — > 0. 
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5 The LSZ Reduction Formula 

Prerequisite: 3 


Let us now consider how to construct appropriate initial and final states 
for scattering experiments. In the free theory, we can create a state of one 
particle by acting on the vacuum state with a creation operator 



1 k) = o t (k)|0) , 

(5.1) 

where 

T(k) = -iJ d 3 xe ikx d 0 <p(x) . 

(5.2) 

The vacuum state 0) 

is annihilated by every a(k), 



a(k)|0) = 0 , 

(5.3) 

and has unit norm, 

<0|0> = 1 . 

(5.4) 


The one-particle state | k) then has the Lorentz- invariant normalization 


(k\k') = (27r) 3 2w5 3 (k - k') , (5.5) 

where u = (k 2 + m 2 ) 1//2 . 

Next, let us define a time-independent operator that (in the free theory) 
creates a particle localized in momentum space near ki, and localized in 
position space near the origin: 

a\ = J d 3 k /i(k)a t (k) , (5.6) 

where 

/i(k) oc exp[— (k - ki) 2 /4cr 2 ] (5.7) 

is an appropriate wave packet, and o is its width in momentum space. 
Consider the state oj|0). If we time evolve this state in the Schrodinger 
picture, the wave packet will propagate (and spread out). The particle is 
thus localized far from the origin as t — > Too. If we consider instead a 
state of the form aja^O), where ki ^ k- 2 , then the two particles are widely 
separated in the far past. 

Let us guess that this still works in the interacting theory. One compli- 
cation is that <F(k) will no longer be time independent, and so a{ , eq. (5.6), 
becomes time dependent as well. Our guess for a suitable initial state of a 
scattering experiment is then 

l*)= lim a\(t)a\(t) |0) . 

t — > — OO 


(5.8) 
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By appropriately normalizing the wave packets, we can make (i\i) = 1, and 
we will assume that this is the case. Similarly, we can consider a final state 

I/) = » ( 5 - 9 ) 

where kj / k' 2 , and (f\f) = 1. This describes two widely separated par- 
ticles in the far future. (We could also consider acting with more creation 
operators, if we are interested in the production of some extra particles in 
the collision of two.) Now the scattering amplitude is simply given by 

We need to find a more useful expression for ( f\i ). To this end, let us 
note that 



= -i J d 3 k /i(k) J d 4 x d 0 (e lkx d 0 ip(x)j 
= — ij d 3 k /i(k) J d 4 x e lkx (dQ + u 2 )(p(x) 

= -if d 3 k /i(k) J d 4 x e ikx (dl + k 2 + m 2 V(x) 

= — i J d 3 k /i(k) J d 4 x e lkx (dQ — V 2 + m 2 )ip(x) 

= — i J d 3 k /i(k) J d 4 x e lkx (dQ — V 2 + m 2 )ip(x) 

= -ij d 3 k /i(k) J d 4 x e ikx (-d 2 + m 2 Mx) . (5.10) 

The first equality is just the fundamental theorem of calculus. To get the 
second, we substituted the definition of a\(t), and combined the d 3 x from 
this definition with the dt to get d A x. The third comes from straightforward 
evaluation of the time derivatives. The fourth uses lo 2 = k 2 + m 2 . The fifth 
writes k 2 as —V 2 acting on e* k ' x . The sixth uses integration by parts to 
move the V 2 onto the field f(x); here the wave packet is needed to avoid a 
surface term. The seventh simply identifies <9 q — V 2 as -d 2 . 

In free-field theory, the right-hand side of eq. (5.10) is zero, since <p(x) 
obeys the Klein-Gordon equation. In an interacting theory, with (say) 
L\ = ^gip 3 , we have instead (— d 2 + m 2 )(p = \g^p 2 . Thus the right-hand 
side of eq. (5.10) is not zero in an interacting theory. 

Rearranging eq. (5.10), we have 

a{(— oo) = a{(+oo) + * J d 3 k /i(k) J d 4 x e ikx (—d 2 + m 2 )Lp(x) . (5.11) 

We will also need the hermitian conjugate of this formula, which (after a 
little more rearranging) reads 

ai(+oo) = ai(— oo)+i J d 3 k /i(k) J d 4 x e~ lkx (— d 2 + rri 2 )p(x) . (5.12) 
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Let us return to the scattering amplitude, 

(f\i) = (0|ai'(+oo)a2'(+oo)a{(— oo)<4(— oo) 1 0) . (5.13) 

Note that the operators are in time order. Thus, if we feel like it, we can 
put in a time- ordering symbol without changing anything: 

(f\i) = (0|Tan(+oo)a2'(+oo)a{(— oo)c4(— oo)|0) . (5-14) 

The symbol T means the product of operators to its right is to be ordered, 
not as written, but with operators at later times to the left of those at 
earlier times. 

Now let us use eqs. (5.11) and (5.12) in eq. (5.14). The time-ordering 
symbol automatically moves all a.;/(— oo)’s to the right, where they anni- 
hilate |0). Similarly, all aJ(+oo)’s move to the left, where they annihilate 

<°l- 

The wave packets no longer play a key role, and we can take the o — > 0 
limit in eq. (5.7), so that /i(k) = 4 3 (k — ki). The initial and final states 
now have a delta-function normalization, the multiparticle generalization 
of eq. (5.5). We are left with 

(f\i) = i n + n ' J d 4 x x e ikixi (-dj + m 2 ) . . . 

d 4 x'i e ~ ife i x i (—dfi + m 2 ) . . . 

X (0|Ty?(.xi) . . . <p(x i) . . . |0) . (5.15) 

This formula has been written to apply to the more general case of n 
incoming particles and n' outgoing particles; the ellipses stand for similar 
factors for each of the other incoming and outgoing particles. 

Eq. (5.15) is the Lehmann- Symanzik-Zimmermann reduction formula, 
or LSZ formula for short. It is one of the key equations of quantum field 
theory. 

However, our derivation of the LSZ formula relied on the supposition 
that the creation operators of free field theory would work comparably in 
the interacting theory. This is a rather suspect assumption, and so we must 
review it. 

Let us consider what we can deduce about the energy and momentum 
eigenstates of the interacting theory on physical grounds. First, we assume 
that there is a unique ground state |0), with zero energy and momentum. 
The first excited state is a state of a single particle with mass m. This 
state can have an arbitrary three-momentum k; its energy is then E = 
lo = (k 2 + m 2 ) 1 / 2 . The next excited state is that of two particles. These 
two particles could form a bound state with energy less than 2 m (like the 
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Figure 5.1: The exact energy eigenstates in the (P ,E) plane. The ground 
state is isolated at (0, 0), the one-particle states form an isolated hyperbola 
that passes through (0, m), and the multi-particle continuum lies at and 
above the hyperbola that passes through (0, 2m). 

hydrogen atom in quantum electrodynamics), but, to keep things simple, let 
us assume that there are no such bound states. Then the lowest possible 
energy of a two-particle state is 2m. However, a two-particle state with 
zero total three-momentum can have any energy above 2m, because the 
two particles could have some relative momentum that contributes to their 
total energy. Thus we are led to a picture of the states of theory as shown 
in fig. (5.1). 

Now let us consider what happens when we act on the ground state 
with the field operator ip{x). To this end, it is helpful to write 

ip(x) = exp(-iP fJ, x t _ l )^(0)exp(+iP fl x fl ) , (5.16) 

where P^ L is the energy-momentum four-vector. (This equation, introduced 
in section 2, is just the relativistic generalization of the Heisenberg equa- 
tion.) Now let us sandwich ip(x) between the ground state (on the right), 
and other possible states (on the left). For example, let us put the ground 
state on the left as well. Then we have 

(0|^(*)|0> = (0|e- iP M0)e + ^|0) 

= < 0 |¥>( 0 )| 0 ) . 


(5.17) 
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To get the second line, we used P p |0) = 0. The final expression is just 
a Lorentz-invariant number. Since |0) is the exact ground state of the 
interacting theory, we have (in general) no idea what this number is. 

We would like (0|<p(0)|0) to be zero. This is because we would like 
a{(±oo), when acting on |0), to create a single particle state. We do not 
want a{(±oo) to create a linear combination of a single particle state and 
the ground state. But this is precisely what will happen if (0|y>(0)|0) is not 
zero. 

So, if v = (0|<p(0)|0) is not zero, we will shift the field <p(x) by the 
constant v. This means that we go back to the lagrangian, and replace 
c p(x ) everywhere by tp(x) + v. This is just a change of the name of the 
operator of interest, and does not affect the physics. However, the shifted 
<p(x) obeys, by construction, (0|<p(x)|0) = 0. 

Let us now consider (p|<p(a;)|0), where | p) is a one-particle state with 
four-momentum p, normalized according to eq. (5.5). Again using eq. (5.16), 
we have 


(p\<p(x)\0) = (p\e~ iPx ip(0)e +iPx \0) 

= e~ ipx (p|<p(0)|0) , (5.18) 

where (p|<p(0)|0) is a Lorentz-invariant number. It is a function of p, but 
the only Lorentz-invariant functions of p are functions of p 2 , and p 2 is just 
the constant — m 2 . So (p|<p(0)|0) is just some number that depends on m 
and (presumably) the other parameters in the lagrangian. 

We would like (p|<p(0)|0) to be one. That is what it is in free-held theory, 
and we know that, in free-held theory, a|(±oo) creates a correctly normal- 
ized one-particle state. Thus, for a{(±oo) to create a correctly normalized 
one-particle state in the interacting theory, we must have (p|<p(0)|0) = 1. 

So, if (p|<p(0)|0) is not equal to one, we will rescale (or, one might say, 
renormalize) <p{x) by a multiplicative constant. This is just a change of the 
name of the operator of interest, and does not affect the physics. However, 
the rescaled <p{x) obeys, by construction, (p|<p(0)|0) = 1. 

Finally, consider {p,n\<p(x)\0}, where \p, n) is a multiparticle state with 
total four-momentum p, and n is short for all other labels (such as relative 
momenta) needed to specify this state. We have 

(p, n\ip(x)\0) = (p,n\e- iPx v(0)e +iPx \0) 

= e-^(p,n|^(0)|0) 

= e~ ipx A n (p) , (5.19) 

where A n (p) is a function of Lorentz invariant products of the various 
(relative and total) four-momenta needed to specify the state. Note that, 
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from fig. (5.1), p° = (p 2 + Af 2 ) 1 / 2 with M > 2m. The invariant mass M is 
one of the parameters included in the set n. 

We would like (p, n|<y?(x)|0) to be zero, because we would like a{(±oo), 
when acting on 1 0) , to create a single particle state. We do not want 
4 (Too) to create any multiparticle states. But this is precisely what may 
happen if (p,n\(p(x)\0) is not zero. 

Actually, we are being a little too strict. We really need (p, ro|a{(±oo)|0) 
to be zero, and perhaps it will be zero even if (p,n\(p(x)\0) is not. Also, we 
really should test a{(±oo)|0) only against normalizable states. Mathemat- 
ically, non-nornralizable states cause all sorts of trouble; mathematicians 
don’t consider them to be states at all. In physics, this usually doesn’t 
bother us, but here we must be especially careful. So let us write 

10) = E/ d3 P On(p)|p,n) , (5.20) 


where the v/i n (p)’s are wave packets for the total three-momentum p. Note 
that eq. (5.20) is highly schematic; the sum over n includes integrals over 
continuous parameters like relative momenta. 

Now we want to examine 


(0|4(0I°) = “*E / 4p0n( P) [ d 3 k f i(k) f d 3 x e lkx d 0 (p,n\<p(x 

n (5.21) 

We will take the limit t — > Too in a moment. Using eq. (5.19), eq. (5.21) 
becomes 


14(01°) = -*E/ d 3 pipn( P) J d 3 k /i(k) J d 3 x(e lkx d 0 e zpx ^jA n ( p) 

= £/ 4C(P) / d 3 kf i(k) J d 3 x (p°+k°)e i ( k - p ' >x A n (p) . 
n (5.22) 


Next we use / d 3 xe’d k p ^' x = (27r) 3 (5 3 (k — p) to get 


(M(t)\0) = E f d 3 P (27r) 3 (p 0 +fc°)C(p)/i(p)^ (p)e i(p0 “ fc0)t , (5-23) 

n J 


where p° = (p 2 + Af 2 ) 1 / 2 and k (> = (p 2 + m 2 ) 1 / 2 . 

Now comes the key point. Note that p° is strictly greater than k°, 
because M > 2 m > m. Thus the integrand of eq. (5.23) contains a phase 
factor that oscillates more and more rapidly as t — ► Too. Therefore, by 
the Riemann-Lebesgue lemma , the right-hand side of eq. (5.23) vanishes as 
t — > Too. 
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Physically, this means that a one-particle wave packet spreads out dif- 
ferently than a multiparticle wave packet, and the overlap between them 
goes to zero as the elapsed time goes to infinity. Thus, even though our 
operator a{ (t) creates some multiparticle states that we don’t want, we 
can “follow” the one-particle state that we do want by using an appropri- 
ate wave packet. By waiting long enough, we can make the multiparticle 
contribution to the scattering amplitude as small as we like. 

Let us recap. The basic formula for a scattering amplitude in terms of 
the fields of an interacting quantum field theory is the LSZ formula, which 
is worth writing down again: 

(f\i) = i n + n ' J d 4 Xl e ikixi (-dl + m 2 ) . . . 

d A x v e- ik '^{-dl+m 2 )... 
x (0 |T<£>(xi) . . . <p(x i) . . . |0) . (5.24) 

The LSZ formula is valid provided that the field obeys 

(0|(/?(x)|0) = 0 and 1 0) = e~ lkx . (5.25) 

These normalization conditions may conflict with our original choice of field 
and parameter normalization in the lagrangian. Consider, for example, a 
lagrangian originally specified as 

C = — \ m 2 ip 2 + gflV 3 • (5.26) 

After shifting and rescaling (and renaming some parameters), we will have 
instead 

C = - ^Z^ipd^ip - \Z m m 2 gr + \Z g g<p 3 + Yip . (5.27) 

Here the three Z's and Y are as yet unknown constants. They must be 
chosen to ensure the validity of eq. (5.25); this gives us two conditions in 
four unknowns. We fix the parameter rn by requiring it to be equal to the 
actual mass of the particle (equivalently, the energy of the first excited state 
relative to the ground state) , and we fix the parameter g by requiring some 
particular scattering cross section to depend on g in some particular way. 
(For example, in quantum electrodynamics, the parameter analogous to g 
is the electron charge e. The low-energy Coulomb scattering cross section 
is proportional to e , with a definite constant of proportionality and no 
higher-order corrections; this relationship defines e.) Thus we have four 
conditions in four unknowns, and it is possible to calculate Y and the three 
Z's order by order in powers of g. 

Next, we must develop the tools needed to compute the correlation 
functions (0|T</?(xi) . . . |0) in an interacting quantum field theory. 
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Reference Notes 

Useful discussions of the LSZ reduction formula can be found in Brown, 
Itzykson & Zuber , Peskin & Schroeder, and Weinberg I. 

Problems 

5.1) Work out the LSZ reduction formula for the complex scalar field that 
was introduced in problem 3.5. Note that we must specify the type 
(a or b) of each incoming and outgoing particle. 
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6 Path Integrals in Quantum Mechanics 

Prerequisite: none 

Consider the nonrelativistic quantum mechanics of one particle in one di- 
mension; the hamiltonian is 

H(P,Q) = ^P 2 + V(Q), (6.1) 

where P and Q are operators obeying [Q,P] = i ■ (We set h = 1 for 
notational convenience.) We wish to evaluate the probability amplitude for 
the particle to start at position q' at time t' , and end at position q" at time 
t" . This amplitude is {q" \e~ lH ^ _t )|</), where | q') and \q") are eigenstates 
of the position operator Q. 

We can also formulate this question in the Heisenberg picture, where op- 
erators are time dependent and the state of the system is time independent, 
as opposed to the more familiar Schrodinger picture. In the Heisenberg pic- 
ture, we write Q(t ) = e lHt Qe~ lHt . We can then define an instantaneous 
eigenstate of Q(t) via Q(t)\q,t) = q\q,t). These instantaneous eigenstates 
can be expressed explicitly as | q,t) = e +lHt \ q), where Q\q) = q\q). Then 
our transition amplitude can be written as {q" . t"\q' , t') in the Heisenberg 
picture. 

To evaluate (q", t”\q',t'), we begin by dividing the time interval T = 
t" — t! into N + 1 equal pieces of duration St = T / (IV + 1). Then introduce 
N complete sets of position eigenstates to get 

N 

n dqj (q"\e- iHSt \q N )(q N \e~ iHSt \q N - 1 ) . . . (qi\e- iHSt \q') . 

3 = 1 

(6-2) 

The integrals over the q* s all run from — oo to +oo. 

Now consider (q 2 \e~ iHSt \qi)- We can use the Campbell-Baker-Hausdorf 
formula 

exp(A + B) = exp(A) exp(R) exp(— ^[A, B] + . . .) (6-3) 

to write 

exp (—iHSt) = exp[— i(St/2m)P 2 ] exp[— i8tV(Q)] exp[0(dt 2 )] . (6.4) 

Then, in the limit of small St. we should be able to ignore the final expo- 
nential. Inserting a complete set of momentum states then gives 

{q 2 \e~ iH5t \qi) = j d Pl (q 2 \e~ iW2m)p2 \pi)(pi\e- iStV ^\ qi ) 

= J d Pl e -iStv {qi ) {q2 \ pi){pi \ qi) 
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e -i(6t/2m)pf e ~iStV(q i) e *pi(g 2 -<?i) 
e -iH(pi,qi)St e ipi(q 2 -qi) 


(6.5) 


To get the third line, we used (q\p) = (27r) -1/2 exp (ipq). 

If we happen to be interested in more general hamiltonians than eq. (6.1), 
then we must worry about the ordering of the P and Q operators in any 
term that contains both. If we adopt Weyl ordering, where the quantum 
hamiltonian H [P, Q ) is given in terms of the classical hamiltonian H (p, q) 
by 

H(P, Q)= f ( ^L d A e ixP+ikQ j dp dq e -ixp-ikq ; ( 6 .6) 

J 2ir 2ir J 

then eq. (6.5) is not quite correct; in the last line, H(pi,q\) should be 
replaced with H(pi,q\), where q\ = \{q\ + 52 ) • For the hamiltonian of 
eq. (6.1), which is Weyl ordered, this replacement makes no difference in 
the limit 5t — > 0. 

Adopting Weyl ordering for the general case, we now have 


(q'i t"\q' , t') 



e -iH( Pj ,qj)5t 


(6.7) 


where q.j = \{qj + qj+i), qo = q', and QN+ 1 = q" ■ If we now define <jj = 
(qj + 1 — qj)/St , and take the formal limit of 5t — > 0, we get 


{q",t"\q',t') = J VqVp exp if dt(p{t)q{t ) - H(p(t), 


( 6 . 8 ) 


The integration is to be understood as over all paths in phase space that 
start at q(t') = q' (with an arbitrary value of the initial momentum) and 
end at q(t") = q" (with an arbitrary value of the final momentum). 

If H (p, q) is no more than quadratic in the momenta [as is the case for 
eq. (6.1)], then the integral over p is gaussian, and can be done in closed 
form. If the term that is quadratic in p is independent of q [as is the case 
for eq. (6.1)], then the prefactors generated by the gaussian integrals are 
all constants, and can be absorbed into the definition of Vq. The result of 
integrating out p is then 


{q" ,t”\q' ,t') = J Vq exp ij dt L(q(t),q{t)) 


(6.9) 


where L(q, q) is computed by first finding the stationary point of the p 
integral by solving 



(pq~H(p, qfj 


dH(p,q ) 
dp 


(6.10) 
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for p in terms of q and q, and then plugging this solution back into pq — H 
to get L. We recognize this procedure from classical mechanics: we are 
passing from the hamiltonian formulation to the lagrangian formulation. 

Now that we have eqs. (6.8) and (6.9), what are we going to do with 
them? Let us begin by considering some generalizations; let us examine, 
for example, (q" , t"\Q(ti)\q' , f'), where t’ <t\ < t" . This is given by 

{q" ,t"\Q(ti)\q' ,t') = {q"\e- lH ^-^Qe- iH ^-^\q') . (6.11) 

In the path integral formula, the extra operator Q inserted at time t\ will 
simply result in an extra factor of q{t\). Thus 

{q p"\Q{t\)\q p’) = j VpVq q(h)e lS , (6.12) 


where S = f[, dt (pq — H). Now let us go in the other direction; consider 
/ VpVq q(ti)q(t 2 )e lS . This clearly requires the operators Q(ti) and Q(t. 2 ), 
but their order depends on whether t\ < t ,2 or t 2 <t\. Thus we have 

J VpVq q(ti)q(t 2 ) e lS = (q" , t"\TQ(ti)Q(t 2 )\q' , t') . (6.13) 


where T is the time ordering symbol : a product of operators to its right is 
to be ordered, not as written, but with operators at later times to the left 
of those at earlier times. This is significant, because time-ordered products 
enter into the LSZ formula for scattering amplitudes. 

To further develop these methods, we need another trick: functional 
derivatives. We define the functional derivative 5/6 f(t ) via 

f(t 2 ) = 6(ti - t 2 ) , (6.14) 

where 5{t ) is the Dirac delta function. Also, functional derivatives are 
defined to satisfy all the usual rules of derivatives (product rule, chain 
rule, etc). Eq. (6.14) can be thought of as the continuous generalization of 
(d/dxQxj = Sij. 

Now, consider modifying the lagrangian of our theory by including ex- 
ternal forces acting on the particle: 

H(p, q) -»• H(p, q) - f(t)q(t) - h(t)p(t ) , (6.15) 


where f(t) and h{t ) are specified functions. In this case we will write 


, r r t" 

(q", t"\q t') f. h = j VpVq exp i J dt [pq - H + fq + hp^j 


. (6.16) 
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where H is the original hamiltonian. Then we have 


J 

II 

1—1 | 

j VpVqq(t 1 )e i I dt ^- H+ f q+hp \ , 

1 d 1 d / // ,//i / ,/\ 

* df(h) i df(t 2 ) {q ’ |c? ’ )f ’ h - u 

j VpVqq(t 1 )q(t 2 )e i I dtlpq - H+fq+hp] , 

j 

II 

-t-i 

-t-i 

H | ' 

j VpVq p(t{) e* S ^te-n+fi+hp] , 


(6.17) 


and so on. After we are done bringing down as many factors of q(ti) or 
p(ti) as we like, we can set f(t) = h(t ) = 0, and return to the original 
hamiltonian. Thus, 


<<?V' \TQ(t 1 )...P(t n )...\ q ',t l ) 


1 <5 1 5 

i Sf(ti) " ‘ i 6h(t n ) ' 




f=h = 0 


(6.18) 


Suppose we are also interested in initial and final states other than 
position eigenstates. Then we must multiply by the wave functions for 
these states, and integrate. We will be interested, in particular, in the 
ground state as both the initial and final state. Also, we will take the 
limits t! — ► — oo and t" — > +oo. The object of our attention is then 


(0|°)/,fc = lim f dq" dq' (q",t"\q , ,t , ) fA 'i/j 0 (q') , (6.19) 

t'^-oo J 

t ,f — *-+oo 

where ipo(q) = (<?|0) is the ground-state wave function. Eq. (6.19) is a 
rather cumbersome formula, however. We will, therefore, employ a trick to 
simplify it. 

Let | n) denote an eigenstate of H with eigenvalue E n . We will suppose 
that Eq = 0; if this is not the case, we will shift H by an appropriate 
constant. Next we write 


q',t’) = e iHt '\q) 

OO 

= Y^ elHt '\ n )( n W) 

n = 0 
oo 

= X) '•l’nW) e ' Ent ' \ n ) > ( 6 - 20 ) 

?i=0 


where il> n {q) = {q\n) is the wave function of the nth eigenstate. Now, 
replace H with (1 —ie)H in eq. (6.20), where e is a small positive infinites- 
imal. Then, take the limit t' — > — oo of eq. (6.20) with e held fixed. Every 
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state except the ground state is then multiplied by a vanishing exponential 
factor, and so the limit is simply V , o(l/ / )|0)- Next, multiply by an arbi- 
trary function %((/), and integrate over q' . The only requirement is that 
(0|x) Q 0. We then have a constant times |0), and this constant can be 
absorbed into the normalization of the path integral. A similar analysis of 
{q",t"\ = (q" \e~ lHt shows that the replacement H — ► (1— ie)H also picks 
out the ground state as the final state in the t" — * +oo limit. 

What all this means is that if we use (1 —ie)H instead of H, we can be 
cavalier about the boundary conditions on the endpoints of the path. Any 
reasonable boundary conditions will result in the ground state as both the 
initial and final state. Thus we have 

(0|0) = J VpVq ex p i J dtt (pq — (1— ie)H + fq + hp^j . (6.21) 

Now let us suppose that H = Hq X Hi, where we can solve for the 
eigenstates and eigenvalues of Hq, and Hi can be treated as a perturbation. 
Suppressing the ie, eq. (6.21) can be written as 

(0|0) M = j VpVq exp i J dt (pq - H 0 (p , q) - HQp, q) + fq + hp ) 

r . r +oc , TT (i s i s \i 

= exp rl oo 

x J VpVq exp i J dt (pq — H^ip. q) + fq + hp'j . (6.22) 

To understand the second line of this equation, take the exponential prefac- 
tor inside the path integral. Then the functional derivatives (that appear 
as the arguments of HQ just pull out appropriate factors of p(t) and q(t), 
generating the right-hand side of the first line. We assume that we can 
compute the functional integral in the second line, since it involves only 
the solvable hamiltonian Hq. The exponential prefactor can then be ex- 
panded in powers of Hi to generate a perturbation series. 

If Hi depends only on q (and not on p), and if we are only interested 
in time-ordered products of Q ’ s (and not P’s), and if H is no more than 
quadratic in P, and if the term quadratic in P does not involve Q, then 
eq. (6.22) can be simplified to 

r r +oc b X 

<oio)/ — ex p|*y_^ «i( 7m )j 

x j T>q exp i J dt (L 0 (q, q) + fq) ■ (6.23) 


where LQq) = —HQq). 
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Reference Notes 

Brown and Ramond I have especially clear treatments of various aspects 
of path integrals. For a careful derivation of the midpoint rule of eq. (6.7), 
see Berry & Mount. 


Problems 

6.1) a) Find an explicit formula for Vq in eq. (6.9). Your formula should 
be of the form Vq = CYlf = \dqj, where C is a constant that you 
should compute. 

b) For the case of a free particle, V ( Q ) = 0, evaluate the path integral 
of eq. (6.9) explicitly. Hint: integrate over qi, then q 2 , etc, and look 
for a pattern. Express you final answer in terms of q ' , t', q " , t” , and 
m. Restore fi by dimensional analysis. 

c) Compute (q" , t"\q', t') = (q"\e~ lHl ' t " ^ t '' 1 \q') by inserting a complete 
set of momentum eigenstates, and performing the integral over the 
momentum. Compare with your result in part (b). 
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7 The Path Integral for the Harmonic 

Oscillator 

Prerequisite: 6 


Consider a harmonic oscillator with hamiltonian 


H(P,Q) = ^P 2 P\mu 2 Q 2 . (7.1) 


We begin with the formula from section 6 for the ground state to ground 
state transition amplitude in the presence of an external force, specialized 
to the case of a harmonic oscillator: 


(0|0)/ = / VpDq exp?' 


r*+QO 


dt 


pq - (1 -ie)H + fq 


(7.2) 


Looking at eq. (7.1), we see that multiplying H by 1—ie is equivalent to 
the replacements mr 1 — > (1— ie)m _1 [or, equivalently, m — > (l-f-ie)m] and 
mu 2 — > (1— ie)rriL 0 2 . Passing to the lagrangian formulation then gives 


< 0 | 0 >/ 


j Dq exp i 



dt 


1(1 +ie)mq 


•2 


^(1— ie)muj 2 q 2 + fq 


From now on, we will simplify the notation by setting m = 1. 
Next, let us use Fourier-transformed variables, 

/ Too n Too 

dt e iEt q(t) , q(t)= — e~ iEt q(E) . 

-oo J— oo 


(7.3) 


(7.4) 


The expression in square brackets in eq. (7.3) becomes 


1 r+°° dE dE' 


,-i(E+E')t 


2 J- r 


27 r 27 t 


(-(1 +ie)EE' - (1 -ie)u 2 )q(E)q(E') 


+ f(E)q(E')+f(E')q(E) 


(7.5) 


Note that the only t dependence is now in the prefactor. Integrating over t 
then generates a factor of 2tt6(E + E'). Then we can easily integrate over 
E' to get 


S = 



1 r+°° dE 

2 J- oo 27T 


((l+ie)E 2 - (l-ie)u; 2 )q(E)q(-E) 


+ f(E)q(-E) + f(-E)q(E) . 


(7.6) 
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The factor in large parentheses is equal to E 2 — u 2 + i(E 2 + w 2 )e, and we 
can absorb the positive coefficient into e to get E 2 — to 2 + ie. 

Now it is convenient to change integration variables to 


x(E) 


q(E) + 


m 

E 2 -u 2 + ie ' 


(7.7) 


Then we get 


S 


1 r+°° dE 

2 J- oo 27T 


x{E){E 2 


uj 2 + ie)x(-E) 


mn-E) ' 

E 2 — uj 2 + ie ' 


(7.8) 


Furthermore, because eq. (7.7) is just a shift by a constant, Vq = Vx. Now 
we have 


(0|0)/ = exp 


i_ r+°° dE f(E)f(-E) 

2 7-00 


27t — E 2 + in 2 — ie 
+°° dFi 


[ Vx exp x(E)(E 2 — co 2 + ie)x(—E) 

J L2 J - oo 27T 


(7.9) 


Now comes the key point. The path integral on the second line of 
eq. (7.9) is what we get for (0|0)/ in the case / = 0. On the other hand, 
if there is no external force, a system in its ground state will remain in its 
ground state, and so (0| 0) /=o = 1- Thus (0| 0) j is given by the first line of 
eq. (7.9), 


(0|0)j = exp 


1 r+°° dE 

2 J- oo 27 T 


mn-E) 

— E 2 + uj 2 — ie 


(7.10) 


We can also rewrite (0|0)/ in terms of time-domain variables as 


(0|0) / = exp 


r+oo 


2j-c 


dtdt' f{t)G(t — t')f{t') 


where 


r+oo fjjr P -iE(t-t') 


(7.11) 


(7.12) 


j- oo 27 t — E 2 + uj 2 — ie 
Note that G(t—t') is a Green’s function for the oscillator equation of motion: 


dt. 2 


+ u) 2 J G(t — t ') = 5(t — t ') . 


(7.13) 


This can be seen directly by plugging eq. (7.12) into eq. (7.13) and then 
taking the e — ► 0 limit. We can also evaluate G(t — tf) explicitly by treating 
the integral over E on the right-hand side of eq. (7.12) as a contour integral 
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in the complex E plane, and then evaluating it via the residue theorem. 
The result is 

G(t — t') = — expf— iu\t — i 7 )') . (7-14) 

Consider now the formula from section 6 for the time-ordered product 
of operators. In the case of initial and final ground states, it becomes 

<°| T Q(«,) • • • |0> = I jjf-j ■ • • (W/l^ • (W5) 

Using our explicit formula, eq. (7.11), we have 

<0|TQ(t,)Q(t 2 )|0) = i i - 4 - (010); 

i 5f(t i) i df(t 2 ) f = o 

I V f r+oo 1 

= -^777T / dt'G(t 2 -t')f(t’)m f 

l 0J(tl) U-oo J f=° 

= jG(t 2 - ti) + (term with /’ s) (0|0)/ 

= \G{t 2 -t l ). (7.16) 

We can continue in this way to compute the ground-state expectation value 
of the time-ordered product of more Q(t)’ s. If the number of Q(t)’s is odd, 
then there is always a left-over f(t) in the prefactor, and so the result is 
zero. If the number of Q(f)’s is even, then we must pair up the functional 
derivatives in an appropriate way to get a nonzero result. Thus, for exam- 
ple, 

{0\TQ(ti)Q(t 2 )Q(t 3 )Q(t4)\0} = \\G{ti-t 2 )G(t 3 -t A ) 

l z L 

+ G(ti—t 3 )G(t 2 —t^) 

+ G{t\— t±)G(t 2 — 1 3 ) . (7.17) 

More generally, 

(0|TQ(f 1 )...Q(f 2n )|0) = l £ G(t il -t i2 )...G(t i2n _ 1 -t i2n ). (7.18) 

pairings 

Problems 

7.1) Starting with eq. (7.12), do the contour integral to verify eq. (7.14). 

7.2) Starting with eq. (7.14), verify eq. (7.13). 
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7.3) a) Use the Heisenberg equation of motion, A = i [H. A], to find explicit 
expressions for Q and P. Solve these to get the Heisenberg-picture 
operators Q(t) and P(t) in terms of the Schrodinger picture operators 
Q and P. 

b) Write the Schrodinger picture operators Q and P in terms of the 

creation and annihilation operators a and a\ where H = a-\-\). 

Then, using your result from part (a), write the Heisenberg-picture 
operators Q{t ) and P(t) in terms of a and a) . 

c) Using your result from part (b), and a|0) = (0|a^ = 0, verify 
eqs. (7.16) and (7.17). 

7.4) Consider a harmonic oscillator in its ground state at t = — oo. It is 
then then subjected to an external force f(t). Compute the probabil- 
ity 1(010)/ 1 2 that the oscillator is still in its ground state at t = +oo. 
Write your answer as a manifestly real expression, and in terms of 
the Fourier transform f(E) = dte lEt f(t,). Your answer should 
not involve any other unevaluated integrals. 
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8 The Path Integral for Free Field Theory 

Prerequisite: 3, 7 

Our results for the harmonic oscillator can be straightforwardly generalized 
to a free field theory with hamiltonian density 

Ho = \P 2 + \{Vip) 2 + \rr?ip 2 . (8.1) 

The dictionary we need is 

q(t) — > y>(x, t) (classical field) 

Q(t) — > y>(x, t) (operator field) 

f(t) — * J(x, t) (classical source ) (8-2) 

The distinction between the classical held <p(x) and the corresponding op- 
erator held should be clear from context. 

To employ the e trick, we multiply TL o by 1 — ie. The results are equiv- 
alent to replacing m 2 in TLq with m 2 — ie. From now on, for notational 
simplicity, we will write m 2 when we really mean m 2 — ie. 

Let us write down the path integral (also called the functional integral ) 
for our free held theory: 

Z 0 (J) = (0|0)j = J V<p e ifd 4 x[Co+M > ( 8 .3) 

where 

C 0 = -^ipd^tp - \m 2 g) 2 ( 8 . 4 ) 

is the lagrangian density, and 

Vip oc ]^[ d(p(x) ( 8 . 5 ) 

X 

is the functional measure. Note that when we say path integral , we now 
mean a path in the space of held configurations. 

We can evaluate Zo(J) by mimicking what we did for the harmonic 
oscillator in section 7 . We introduce four-dimensional Fourier transforms, 

m = J d A x e~ ik * *>(x) , *>(x) = j(0yi e ikx W) , (8.6) 

where kx = —k°t + k-x, and k° is an integration variable. Then, starting 
with So = f d 4 x [Cq + J ip], we get 

So = -J ^4 [-<p(k)(k 2 +m 2 )(p(-k) + J{k)<p(-k) + J(-k)(p(k)\ , ( 8 . 7 ) 
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where k 2 = k 2 — (A; 0 ) 2 . We now change path integration variables to 

m 


X{k) = <p(k) - 


k 2 + m 2 


( 8 . 8 ) 


Since this is merely a shift by a constant, we have T> = T >\ . The action 
becomes 


Sn = 


1 


d 4 k 


J(k)J(-k) 

k 2 




~ X(k)(k 2 +m 2 )x{~k) 


(8.9) 


Just as for the harmonic oscillator, the integral over % simply yields a factor 
of Zq{ 0) = (0|0)j=o = 1. Therefore 


Zq{J) = exp 


* r d 4 k J{k)J{-k ) 

2 J (27 r) 4 k 2 + m 2 — ie 


^ J d 4 x d A x J{x) A(x — x')J{x') 


= exp | 

Here we have defined the Feynman propagator, 

d 4 k e ik(x~ x ') 


A(x — x') = j 


(27 r) 4 k 2 + m 2 — ie 


(8.10) 


( 8 . 11 ) 


The Feynman propagator is a Green’s function for the Klein-Gordon equa- 
tion, 

(—d 2 + m 2 ) A(x — x') = J 4 (a: — x') . (8.12) 

This can be seen directly by plugging eq. (8.11) into eq. (8.12) and then 
taking the e — > 0 limit. We can also evaluate A(x — x') explicitly by 
treating the k° integral on the right-hand side of eq. (8.11) as a contour 
integral in the complex k° plane, and then evaluating it via the residue 
theorem. The result is 

A(x - x') = ijdk e *-(x-x')-Ht-t'| 

= J dk e ik ^- x,) + ie(t'-t) J dke- ik ^- x,) , (8.13) 

where 6(t ) is the unit step function. The integral over dk can also be 
performed in terms of Bessel functions; see section 4. 

Now, by analogy with the formula for the ground-state expectation 
value of a time-ordered product of operators for the harmonic oscillator, 
we have 

1 8 


(0|T<p(xi)...|0) = - 


i 8J(x i) 


...Z 0 (J) 


j = o 


(8.14) 
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Using our explicit formula, eq. (8.10), we have 

(onv^M^lo) = 7«57)la7^) Zo ( J )l.o 

= 7 sJW) [/ A ' A(l2 - x ' )J(x " > ] Z|)(J) L 

= }A(x 2 — xi) + (term with J’s) Zq(J) 

= }A(x 2 — xi) . (8.15) 

We can continue in this way to compute the ground-state expectation value 
of the time-ordered product of more cp’s. If the number of <^’s is odd, then 
there is always a left-over J in the prefactor, and so the result is zero. If 
the number of ip’s is even, then we must pair up the functional derivatives 
in an appropriate way to get a nonzero result. Thus, for example, 

(0|T^(xi)<^(x2)^(x3)^(x 4 )|0) = -tL [A(xi— x 2 )A(x 3 — x 4 ) 

i z L 

+ A(xi-x 3 )A(x 2 -x 4 ) 

+ A(xi-x 4 )A(x 2 -x 3 ) . (8.16) 

More generally, 

(0|T</?(xi) . . . <^(x 2n )|0) = — Y, A(x il -x i2 )... A(x i2n _ 1 -x i2 „) . (8.17) 

pairings 

This result is known as Wick’s theorem. 

Problems 

8.1) Starting with eq. (8.11), verify eq. (8.12). 

8.2) Starting with eq. (8.11), verify eq. (8.13). 

8.3) Starting with eq. (8.13), verify eq. (8.12). Note that the time deriva- 
tives in the Klein-Gordon wave operator can act on either the field 
(which obeys the Klein-Gordon equation) or the time-ordering step 
functions. 

8.4) Use eqs. (3.19), (3.29), and (5.3) (and its hermitian conjugate) to 
verify the last line of eq. (8.15). 

8.5) The retarded and advanced Green’s functions for the Klein-Gordon 
wave operator satisfy A re t (x — y) = 0 for x° > y° and A a( j v (x — y) = 0 
for x° < y°. Find the pole prescriptions on the right-hand side of 
eq. (8.11) that yield these Green’s functions. 
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8.6) Let Zq(J) = exp iWfiJ), and evaluate the real and imaginary parts 
of W 0 (J). 

8.7) Repeat the analysis of this section for the complex scalar field that was 
introduced in problem 3.5, and further studied in problem 5.1. Write 
your source term in the form J^(p+ and find an explicit formula, 
analogous to eq. (8.10), for Zq(J\J). Write down the appropriate 
generalization of eq. (8.14), and use it to compute (0 |T<^(xi)<^>(x2)|0), 
(0|T^(xi)<^(x 2)|0), and (0 |T^(.ti)^(x 2)|0). Then verify your re- 
sults by using the method of problem 8.4. Finally, give the appropri- 
ate generalization of eq. (8.17). 

8.8) A harmonic oscillator (in units with m = h = 1) has a ground-state 
wave function (g|0) oc e~ uq / 2 . Now consider a real scalar field g>(x), 
and define a field eigenstate |A) that obeys 

7 >(x, 0 )|A) = A(x)|A) , (8.18) 


where the function A(x) is everywhere real. For a free-held theory 
specified by the hamiltonian of eq. (8.1), Show that the ground-state 
wave functional is 


(A|0) oc exp 




(8.19) 


where A(k) = / d 3 xe ,k x A(x) and co(k) = (k 2 + m 2 ) 1 / 2 . 
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9 The Path Integral for Interacting Field 

Theory 

Prerequisite: 8 

Let us consider an interacting quantum field theory specified by a la- 
grangian of the form 

C = - \Z v d^ipdfj,ip - \Z m rn 2 <p 2 + \Z 9 gip z + Yip . (9.1) 

As we discussed at the end of section 5, we fix the parameter rn by requiring 
it to be equal to the actual mass of the particle (equivalently, the energy 
of the first excited state relative to the ground state), and we fix the pa- 
rameter g by requiring some particular scattering cross section to depend 
on g in some particular way. (We will have more to say about this after we 
have learned to calculate cross sections.) We also assume that the field is 
normalized by 

(0|(/j(x)|0) = 0 and (fc|</?(a;)|0) = e~ lkx . (9.2) 

Here |0) is the ground state, normalized via (0|0) = 1, and | k) is a state of 
one particle with four- momentum k ,J \ where k 2 = k^k^ L = —m 2 , normalized 
via 

(k'\k) = (27r) 3 2fc 0 <5 3 (k / - k) . (9.3) 

Thus we have four conditions (the specified values of m, g, (0|<^|0), and 
(fc|<^|0)), and we will use these four conditions to determine the values of 
the four remaining parameters ( Y and the three Z' s) that appear in C. 

Before going further, we should note that this theory (known as (p 3 
theory, pronounced “phi-cubed”) actually has a fatal flaw. The hamiltonian 
density is 

U = \Z~ l Tf 2 -Yip + \Z m m 2 p> 2 - lZ g g(p 3 . (9.4) 

Classically, we can make this arbitrarily negative by choosing an arbitrarily 
large value for <p. Quantum mechanically, this means that this hamiltonian 
has no ground state. If we start off near <p = 0, we can tunnel through the 
potential barrier to large p , and then “roll down the hill”. However, this 
process is invisible in perturbation theory in g. The situation is exactly 
analogous to the problem of a harmonic oscillator perturbed by a q 3 term. 
This system also has no ground state, but perturbation theory (both time 
dependent and time independent) does not “know” this. We will be inter- 
ested in eq. (9.1) only as an example of how to do perturbation expansions 
in a simple context, and so we will overlook this problem. 

We would like to evaluate the path integral for this theory, 

Z(J) = (0|0)j = J Vip e i i d*x[Co+C l+ M _ 


(9.5) 
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We can evaluate Z(J) by mimicking what we did for quantum mechanics 
at the end of section 6. Specifically, we can rewrite eq. (9.5) as 

Z(J) = J pfd-^o+M _ 

ex z 0 (J) , (9.6) 

where Zq(J) is the result in free- field theory, 

Zq(J) = exp - J d 4 x d 4 x' J(x)A(x — x') J(x') . (9-7) 

We have written Z(J) as proportional to (rather than equal to) the right- 
hand side of eq. (9.6) because the e trick does not give us the correct overall 
normalization; instead, we must require Z{ 0) = 1, and enforce this by hand. 
Note that, in eq. (9.7), we have implicitly assumed that 

£ 0 = ~\d^(pd^ - \m 2 p 2 , (9.8) 

since this is the Cq that gives us eq. (9.7). Therefore, the rest of C must be 
included in C\ . We write 

£ i = + £ct , 

Cct = -^Z^-Vjd^ipd^ - \{Z m -T)rn 2 (p 2 + Yip , (9.9) 

where C c t is called the counterterm lagrangian. We expect that, as g — ► 0, 
Y — > 0 and Z % —> 1. In fact, as we will see, Y = 0(g ) and Zj = 1 + 0(g 2 ). 

In order to make use of eq. (9.7), we will have to compute lots and lots of 
functional derivatives of Zq{J). Let us begin by ignoring the counterterms. 
We define 

Z\{J) oc exp ^ Z g g j d 4 x | JJJ~ j z o(J) , (9.10) 


where the constant of proportionality is fixed by -Zi(O) = 1. We now make 
a dual Taylor expansion in powers of g and J to get 




P\ I 2 


J d\jd A z J(y)A(y-z)J(z) . (9.11) 


If we focus on a term in eq. (9.11) with particular values of V and P, then 
the number of surviving sources (after we take all the functional derivatives) 
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S = 2 3 S = 2 x 3! 


Figure 9.1: All connected diagrams with E = 0 and V = 2. 






S = 4! 


Figure 9.2: All connected diagrams with E = 0 and V = 4. 


is E = 2 P — 3V. (Here E stands for external, a terminology that should 
become clear by the end of the next section; V stands for vertex and P for 
propagator.) The overall phase factor of such a term is then i}' (1 i p = 
jV+E- p , and the 3V functional derivatives can act on the 2 P sources in 
(2P)\/ {2P—3V)\ different combinations. However, many of the resulting 
expressions are algebraically identical. 

To organize them, we introduce Feynman diagrams. In these diagrams, 
a line segment (straight or curved) stands for a propagator iA (x—y), a 
filled circle at one end of a line segment for a source i J d 4 xJ(x), and a 
vertex joining three line segments for iZ^g J d^x. Sets of diagrams with 
different values of E and V are shown in figs. (9.1-9.11). 

To count the number of terms on the right-hand side of eq. (9.11) that 
result in a particular diagram, we first note that, in each diagram, the num- 
ber of lines is P and the number of vertices is V. We can rearrange the 
three functional derivatives from a particular vertex without changing the 
resulting diagram; this yields a counting factor of 3! for each vertex. Also, 
we can rearrange the vertices themselves; this yields a counting factor of 
VI. Similarly, we can rearrange the two sources at the ends of a particular 
propagator without changing the resulting diagram; this yields a counting 
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factor of 2! for each propagator. Also, we can rearrange the propagators 
themselves; this yields a counting factor of P\. All together, these count- 
ing factors neatly cancel the numbers from the dual Taylor expansions in 
eq. (9.11). 

However, this procedure generally results in an overcounting of the num- 
ber of terms that give identical results. This happens when some rearrange- 
ment of derivatives gives the same match-up to sources as some rearrange- 
ment of sources. This possibility is always connected to some symmetry 
property of the diagram, and so the factor by which we have overcounted 
is called the symmetry factor. The figures show the symmetry factor S of 
each diagram. 

Consider, for example, the second diagram of fig. (9.1). The three prop- 
agators can be rearranged in 3! ways, and all these rearrangements can 
be duplicated by exchanging the derivatives at the vertices. Furthermore 
the endpoints of each propagator can be simultaneously swapped, and the 
effect duplicated by swapping the two vertices. Thus, S = 2 x 3! = 12. 

Let us consider two more examples. In the first diagram of fig. (9.6), 
the exchange of the two external propagators (along with their attached 
sources) can be duplicated by exchanging all the derivatives at one vertex 
for those at the other, and simultaneously swapping the endpoints of each 
semicircular propagator. Also, the effect of swapping the top and bottom 
semicircular propagators can be duplicated by swapping the corresponding 
derivatives at each vertex. Thus, the symmetry factor is S = 2 x 2 = 4. 

In the diagram of fig. (9.10), we can exchange derivatives to match swaps 
of the top and bottom external propagators on the left, or the top and 
bottom external propagators on the right, or the set of external propagators 
on the left with the set of external propagators on the right. Thus, the 
symmetry factor isS = 2x2x2 = 8. 

The diagrams in figs. (9.1-9.11) are all connected: we can trace a path 
through the diagram between any two points on it. However, these are 
not the only contributions to Z{J). The most general diagram consists of 
a product of several connected diagrams. Let Ci stand for a particular 
connected diagram, including its symmetry factor. A general diagram D 
can then be expressed as 

D =^nvir , (9-i2) 

where n/ is an integer that counts the number of C/’s in D, and Sd is the 
additional symmetry factor for D (that is, the part of the symmetry factor 
that is not already accounted for by the symmetry factors already included 
in each of the connected diagrams). We now need to determine Sd- 
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S = 2 


Figure 9.3: All connected diagrams with E = 1 and V = 1. 


-CD KX) 



S = 2 2 


S = 2 2 


S = 2 3 


Figure 9.4: All connected diagrams with E = 1 and V = 3. 



S = 2 


Figure 9.5: All connected diagrams with E = 2 and V = 0. 



S = 2 2 



S = 2 2 


Figure 9.6: All connected diagrams with E = 2 and V = 2. 
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Since we have already accounted for propagator and vertex rearrange- 
ments within each Cj, we need to consider only exchanges of propagators 
and vertices among different connected diagrams. These can leave the total 
diagram D unchanged only if (1) the exchanges are made among different 
but identical connected diagrams, and only if (2) the exchanges involve all 
of the propagators and vertices in a given connected diagram. If there are 
n/ factors of Cj in D, there are njl ways to make these rearrangements. 
Overall, then, we have 

^ = IW- ( 9 - 13 ) 

i 

Now Z\ (J) is given (up to an overall normalization) by summing all dia- 
grams D, and each D is labeled by the integers nj. Therefore 

Z 1 (J) oc ^ D 
(ml 

<* 

/ n ‘- 

oo 

I m= 0 ' 11 ' 

oc Ilexp (C 1 /) 

i 

OC exp(E/C/) . (9.14) 

Thus we have a remarkable result: Z\(J) is given by the exponential of the 
sum of connected diagrams. This makes it easy to impose the normalization 
Zi(0) = 1: we simply omit the vacuum diagrams (those with no sources), 
like those of figs. (9.1) and (9.2). We then have 

Zi(J) =exp[iWi(J)] ,. (9.15) 

where we have defined 

iW\(J) =£<?/, (9.16) 

mo} 

and the notation I {0} means that the vacuum diagrams are omitted 
from the sum, so that ITj (0) = 0. 1 

Were it not for the counterterms in C\, we would have Z{J) = Z\ ( J). 
Let us see what we would get if this was, in fact, the case. In particular, let 
us compute the vacuum expectation value of the field which is given 

1 We have included a factor of i on the left-hand side of eq. (9.16) because then W\{J) 
is real in free-held theory; see problem 8.6. 
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S = 2 2 S = 2 2 

Figure 9.7: All connected diagrams with E = 2 and V = 4. 

S = 3! 

Figure 9.8: All connected diagrams with E = 3 and V = 1. 








9: The Path Integral for Interacting Field Theory 


78 



Figure 9.9: All connected diagrams with E = 3 and V = 3. 

K 

S = 2 3 


Figure 9.10: All connected diagrams with E = 4 and V = 2. 



Figure 9.11: All connected diagrams with E = 4 and V = 4. 
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• — x #-Q-x • — 

S = 1 S = 2 S = 2 



S = 2 


Figure 9.12: All connected diagrams with E = 1, X > 1 (where X is the 
number of one-point vertices from the linear counterterm), and V + X <3. 


by 


(0|<^(x)|0) 


1 5 

i 5J(x 

6 

SJ(x) 


Zi(J) 


j = o 


IF, (.7) 


j = o 


(9.17) 


This expression is then the sum of all diagrams [such as those in figs. (9.3) 
and (9.4)] that have a single source, with the source removed: 


(0|<p(x)|0) = \ig J d A y \A{x-y)\A(y-y) + 0(g 3 ) . (9.18) 


Here we have set Z g = 1 in the first term, since Z g = 1 + 0(g 2 ). We see the 
vacuum-expectation value of tp(x) is not zero, as is required for the validity 
of the LSZ formula. To fix this, we must introduce the counterterm Y<p. 
Including this term in the interaction lagrangian C\ introduces a new kind 
of vertex, one where a single line segment ends; the corresponding vertex 
factor is iY f d^y. The simplest diagrams including this new vertex are 
shown in fig. (9.12), with a cross symbolizing the vertex. 

Assuming Y = 0(g), only the first diagram in fig. (9.12) contributes at 
O(g), and we have 

(0|<y?(x)|0) = {iY + i(z 5 )iA(0)) J d 4 y \A(x-y) + 0(g 3 ) . (9.19) 

Thus, in order to have (0|<^(x)|0) = 0, we should choose 

Y = ^A(O) + 0(g 3 ) . (9.20) 


The factor of i is disturbing, because Y must be a real number: it is the 
coefficient of a hermitian operator in the hamiltonian, as seen in eq. (9.4). 
Therefore, A(0) must be purely imaginary, or we are in trouble. We have 


A(0) = 


d A k 1 
(27t) 4 k 2 + m 2 — ie 


(9.21) 
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From eq. (9.21), it is not immediately obvious whether or not A(0) is purely 
imaginary, but eq. (9.21) does reveal another problem: the integral diverges 
at large k. This is another example of an ultraviolet divergence, similar to 
the one we encountered in section 3 when we computed the zero-point 
energy of the field. 

To make some progress, we introduce an ultraviolet cutoff A, which we 
assume is much larger than m and any other energy of physical interest. 
Modifications to the propagator above some cutoff may be well justified 
physically; for example, quantum fluctuations in spacetime itself should 
become important above the Planck scale, which is given by the inverse 
square root of Newton’s constant, and has the numerical value of 10 19 GeV 
(compared to, say, the proton mass, which is lGeV). 

In order to retain the Lorentz-transformation properties of the propa- 
gator, we implement the ultraviolet cutoff in a more subtle way than we 
did in section 3; specfically, we make the replacement 


A(x 


d A k e ik I x -y) f A 2 \ 2 
(27t) 4 k 2 + m 2 — ie y k 2 + A 2 — ie J 


(9.22) 


The integral is now convergent, and we can evaluate the modified A(0) 
with the methods of section 14; for A 3> m, the result is 


A(0) 



(9.23) 


Thus Y is real, as required. If we like, we can now formally take the limit 
A — » oo. The parameter Y becomes infinite, but (0|<^(a;)|0) remains zero, 
at least to this order in g. 

It may be disturbing to have a parameter in the lagrangian that is 
formally infinite. However, such parameters are not directly measurable, 
and so need not obey our preconceptions about their magnitudes. Also, it 
is important to remember that Y includes a factor of g\ this means that we 
can expand in powers of Y as part of our general expansion in powers of g. 
When we compute something measurable (like a scattering cross section), 
all the formally infinite numbers will cancel in a well-defined way, leaving 
behind finite coefficients for the various powers of g. We will see how this 
works in detail in sections 14-20. 

As we go to higher orders in g, things become more complicated, but 
in principle the procedure is the same. Thus, at 0(g 3 ), we sum up the 
diagrams of figs. (9.4) and (9.12), and then add to Y whatever 0(g 3 ) term 
is needed to maintain (0|<^(a;)|0) = 0. In this way we can determine the 
value of Y order by order in powers of g. 

Once this is done, there is a remarkable simplification. Our adjustment 
of Y to keep (0 |<^(.t)| 0) = 0 means that the sum of all connected diagrams 
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•OO* #H Q h# 

V A A 



Figure 9.13: All connected diagrams without tadpoles with E < 4 and 
V < 4. 


with a single source is zero. Consider now that same infinite set of diagrams, 
but replace the single source in each of them with some other sub diagram. 
Here is the point: no matter what this replacement subdiagram is, the sum 
of all these diagrams is still zero. Therefore, we need not bother to compute 
any of them! The rule is this: ignore any diagram that, when a single line is 
cut, falls into two parts, one of which has no sources. All of these diagrams 
(known as tadpoles) are canceled by the Y counterterm, no matter what 
subdiagram they are attached to. The diagrams that remain (and need to 
be computed!) are shown in fig. (9.13). 

We turn next to the remaining two counterterms. For notational sim- 
plicity we define 

A = — 1 , 


B — Z m — 1 , 


(9.24) 
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and recall that we expect each of these to be 0{g 2 ). We now have 


Z(J) = exp 



crx 


i SJ(x) 


(—Ad 2 + Bm 2 ^j 


i SJ(x) 


Zi(J) • 


(9.25) 

We have integrated by parts to put both <9 x ’s onto one S/6J(x). (Note that 
the time derivatives in this interaction should really be treated by including 
an extra source term for the conjugate momentum II = <p. However, the 
space derivatives are correctly treated, and then the time derivatives must 
work out comparably by Lorentz invariance.) 

Eq. (9.25) results in a new vertex at which two lines meet. The corre- 
sponding vertex factor is (— i) J d 4 x {—Ad 2 + Bm 2 ); the d 2 acts on the x in 
one or the other (but not both) propagators. (Which one does not matter, 
and can be changed via integration by parts.) Diagramatically, all we need 
do is sprinkle these new vertices onto the propagators in our existing dia- 
grams. How many of these vertices we need to add depends on the order 
in g we are working to achieve. 

This completes our calculation of Z(J ) in 99 s theory. We express it as 


Z{J) = exp [iW (J)] , (9.26) 

where W ( J) is given by the sum of all connected diagrams with no tad- 
poles and at least two sources, and including the counterterm vertices just 
discussed. 

Now that we have Z(J ), we must find out what we can do with it. 


Problems 


9.1) Compute the symmetry factor for each diagram in fig. (9.13). (You 
can then check your answers by consulting the earlier figures.) 

9.2) Consider a real scalar field with £ = £0 + £ 1 , where 

£ 0 = -^d^ipd^ - \rri 2 g? , 

£1 = ~^gZ\\Lp 4 +£ c t > 

£ c t = -\{Z (fi -T)& l ipd l j,ip - \{Z m -l)m 2 ip 2 . 

a) What kind of vertex appears in the diagrams for this theory (that 
is, how many line segments does it join?), and what is the associated 
vertex factor? 

b) Ignoring the counterterms, draw all the connected diagrams with 
1 < E < 4 and 0 < V <2, and find their symmetry factors. 

c) Explain why we did not have to include a counterterm linear in 
to cancel tadpoles. 
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9.3) Consider a complex scalar field (see problems 3.5, 5.1, and 8.7) with 
C = Cq + Ci, where 


To = -d^^dpcp - mW , 

T\ = — tY + Tct , 

C ct = -(Z ¥ ,-l)d / V t 3 M <£ - (Z m -l)mW • 

This theory has two kinds of sources, J and J', and so we need a 
way to tell which is which when we draw the diagrams. Rather than 
labeling the source blobs with a J or J' , we will indicate which is 
which by putting an arrow on the attached propagator that points 
towards the source if it is a jt, and away from the source if it is a J. 

a) What kind of vertex appears in the diagrams for this theory, and 
what is the associated vertex factor? Hint: your answer should involve 
those arrows! 

b) Ignoring the counterterms, draw all the connected diagrams with 
1 < E < 4 and 0 < V < 2, and find their symmetry factors. Hint: 
the arrows are important! 


9.4) Consider the integral 


exp W ( 5 , J) 


1 


r+oo 


dx exp —hx 2 + t -gx 3 + Jx 


\/2tt J-c 


(9.27) 


This integral does not converge, but it can be used to generate a joint 
power series in g and J, 


OO OO 

W{g,J) = '£, £ Cy,Eg v J E ■ 


V= 0 E = 0 


(9.28) 


a) Show that 

Cv,E = > ( 9 - 29 ) 

I Sl 

where the sum is over all connected Feynman diagrams with E sources 
and V three-point vertices, and Sj is the symmetry factor for each 
diagram. 

b) Use eqs. (9.27) and (9.28) to compute Cy.E for V < 4 and E < 5. 
(This is most easily done with a symbolic manipulation program like 
Mathematica.) Verify that the symmetry factors given in figs. (9.1- 
9.11) satisfy the sum rule of eq. (9.29). 
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c) Now consider W(g, J+Y ), with Y fixed by the “no tadpole” con- 
dition 

4~t w (9, J+y ) = 0 • (9.30) 

oJ j = o 

Then write 

OO OO 

W(g, J+Y) = Y, E Cv,E 9 V J E • (9-31) 

V=0 E = 0 

Show that 

Cv.E = ~q~ ' (9.32) 

I Sl 

where the sum is over all connected Feynman diagrams with E sources 
and V three-point vertices and no tadpoles , and Sj is the symmetry 
factor for each diagram. 

d) Let Y = aig + a^g 3 + . . . , and use eq. (9.30) to determine ai and 
03. Compute Cy.E for V < 4 and E < 4. Verify that the symmetry 
factors for the diagrams in fig. (9.13) satisfy the sum rule of eq. (9.32). 

9.5) The interaction picture. In this problem, we will derive a formula for 
(0|T<£>(x n ) . . . </?(xi)|0) without using path integrals. Suppose we have 
a hamiltonian density TL = Ho + Hi, where Ho = ^Ll 2 + ^(V<^) 2 + 
\ m 2 ip 2 , and H\ is a function of n(x, 0) and ip(x, 0) and their spatial 
derivatives. (It should be chosen to preserve Lorentz invariance, but 
we will not be concerned with this issue.) We add a constant to H 
so that H\ 0) = 0. Let |0) be the ground state of Ho, with a constant 
added to Hq so that Ho\$) = 0. {H\ is then defined as H — Ho-) The 
Heisenberg-picture field is 

<p(x, t) = e iHt ip(x, 0)e~ iHt . (9.33) 

We now define the interaction-picture field 

V?j(x, t) = e lHot ip(x, 0)e~ zHot . (9.34) 

a) Show that +i(x) obeys the Klein-Gordon equation, and hence is a 
free field. 

b) Show that ip(x) = W (t)ipi(x)U(t), where U(t) = e lHot e~ lHt is 
unitary. 

c) Show that U (t) obeys the differential equation i£u(t) = Hj(t)U(t), 
where Hj(t) = e lHot H\e lHot is the interaction hamiltonian in the in- 
teraction picture, and the boundary condition 17(0) = 1. 
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d) If Hi is specified by a particular function of the Schrodinger-picture 
fields II(x, 0) and y>(x, 0), show that H/(t) is given by the same func- 
tion of the interaction-picture fields II/(x, t) and </3/(x, f). 

e) Show that, for t > 0, 


U{t) = T exp 



(9.35) 


obeys the differential equation and boundary condition of part (c). 
What is the comparable expression for t < 0? Hint: you may need to 
define a new ordering symbol. 

f) Define U(t 2 ,t\) = U(t 2 )U^{t\). Show that, for t 2 > t\, 


U(t 2 ,h) = Texp 



(9.36) 


What is the comparable expression for t\ > 

g) For any time ordering, show that U{tz,t\) = U (t^,t, 2 )U {t 2l ti) and 
that U\ti,t 2 ) = U(t 2 ,ti). 

h) Show that 

tp(x n ) . . . tp(x i) = U ] (t n ,0)^ I (x n )U(t n ,t n -i)(p I (x n - 1 ) 

. . . U(t2,ti)ipi(xi)U(ti,0) . (9.37) 


i) Show that U\t n , 0) = U\oo, 0)U(oo, t n ) and also that U(ti,0) = 
U(t\, — oo)U(— oo, 0). 

j) Replace Hq with (1— ie)Ho, and show that (0|t/t(oo,0) = (O|0)(0| 
and that U(— oo,0)|0) = |0)(0|O). 

k) Show that 

(0| (p(x n ) ■ ■ • </?Oi)|0) = (0|t/(oo,t n )(/?/(.T n )t/(t„,t n _i)^/(x n _i) . . . 

U(t 2 ,ti)vi{xi)U(ti , — oo)|0) 
x |(0|O)| 2 . (9.38) 


1) Show that 

(0|T^(x n )...^(x 1 )|0) = (0|T V /(*n)...¥’/(* 1 )e“^ <rteWi(x) |0) 

x |(0|O)| 2 . (9.39) 


m) Show that 


|(0|O)| 2 = l/{%\Te- l $ dAxHl{x) \®) . 


(9.40) 
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Thus we have 


(0|T<p(aj n ) . . . <^(xi)|0) 


mT i p I (x n )...^ I (x 1 )e- i I d4x ' Hl ^m 

(0|Te _i / d4:rW/(a:) |0) 


(9.41) 

We can now Taylor expand the exponentials on the right-hand side 
of eq. (9.41), and use free-held theory to compute the resulting corre- 
lation functions. 
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10 Scattering Amplitudes and the Feynman 

Rules 

Prerequisite: 5, 9 


Now that we have an expression for Z(J) = exp iW(J), we can take func- 
tional derivatives to compute vacuum expectation values of time-ordered 
products of fields. Consider the case of two fields; we define the exact 
propagator via 

iA(xi - x 2 ) = ( 0 |T<^(xi)<Ax 2 )| 0 ) • (10.1) 

For notational simplicity let us define 


, I s 

^ i SJ(xj) 


(10.2) 


Then we have 


(0|T<p(xi)<p(x 2 )|0) 



=0 



8\8 2 iW{J) 



J = 0 

8\8 2 iW{J) 

J=0 ' 


(10.3) 


To get the last line we used 8jW(J)\j=o = (0|</?(xj)|0) = 0. Diagramat- 
ically, <5i removes a source, and labels the propagator endpoint xi. Thus 
AA(xi— x 2 ) is given by the sum of diagrams with two sources, with those 
sources removed and the endpoints labeled xq and x 2 . (The labels must be 
applied in both ways. If the diagram was originally symmetric on exchange 
of the two sources, the associated symmetry factor of 2 is then canceled by 
the double labeling.) At lowest order, the only contribution is the “barbell” 
diagram of fig. (9.5) with the sources removed. Thus we recover the obvious 
fact that AA(x’i— x 2 ) = iA(xi— x 2 ) + 0{g 2 ). We will take up the subject 
of the 0(g 2 ) corrections in section 14. 

For now, let us go on to compute 


(0|Tv9(xi)<Ax 2 Mx 3 )<y9(x4)|0) = 8l8 2 8 3 8, [Z (J) 


SMStiW 


+ (5i5 2 iW)(5z5iiW) 
+ (<5i<W)F2<W) 
+ FkMIUX-WIA) 


(10.4) 


We have dropped terms that contain a factor of (0|y>(x)|0) = 0. According 
to eq. (10.3), the last three terms in eq. (10.4) simply give products of the 
exact propagators. 
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Let us see what happens when these terms are inserted into the LSZ 
formula for two incoming and two outgoing particles, 

(f\i) = i 4 J Ai d 4 x 2 d% d% 

x(— d 2 + m 2 )(— d 2 + m 2 )(— <9 2 +m 2 )(— d 2 , +m 2 ) 
x{0\T<p(xi)g>(x2)y(xi)g>(x' 2 )\0) . (10.5) 

If we consider, for example, A A(aq — x'^j A.(x 2 — x 2 ) as one term in the 
correlation function in eq. (10.5), we get from this term 

J d 4 x i d\ 2 d% d% e^ixi+^-h^ x i-W2)F(x ir )F(x 22 ') 

= (27T) 4 5\k 1 -k , 1 ) (2ir) 4 5\k 2 -k’ 2 ) F(kw) F(k 22 , ) , (10.6) 

where F(xij) = (— df+m 2 ){— &j+m 2 ) A(xy), F(k) is its Fourier transform, 
Xiji = Xi—x'j, and k t j> = (ki+kj)/ 2. The important point is the two delta 
functions: these tell us that the four-momenta of the two outgoing particles 
(T and 2 r ) are equal to the four- momenta of the two incoming particles 
(1 and 2). In other words, no scattering has occurred. This is not the 
event whose probability we wish to compute! The other two similar terms 
in eq. (10.4) either contribute to “no scattering” events, or vanish due to 
factors like 6 4 (k\+k 2 ) (which is zero because k®+k 2 > 2m > 0). In general, 
the diagrams that contribute to the scattering process of interest are only 
those that are fully connected: every endpoint can be reached from every 
other endpoint by tracing through the diagram. These are the diagrams 
that arise from all the 5' s acting on a single factor of W . Therefore, from 
here on, we restrict our attention to those diagrams alone. We define the 
connected correlation functions via 

<0|T^(xi) . . . <p(x E ) |0>c = Sx . . . S E iW(J)\ j=o , (10.7) 

and use these instead of (0|Ty>(aq) . . . (p(x E ) |0) in the LSZ formula. 
Returning to eq. (10.4), we have 

{0\Fg 2 (x 1 )^(x 2 )^(x' 1 )^(x' 2 )\0)c = 8i5 2 5r5 2 nW . (10.8) 

The lowest-order (in g ) nonzero contribution to this comes from the diagram 
of fig. (9.10), which has four sources and two vertices. The four <5’s remove 
the four sources; there are 4! ways of matching up the 8’ s to the sources. 
These 24 diagrams can then be collected into 3 groups of 8 diagrams each; 
the 8 diagrams in each group are identical. The 3 distinct diagrams are 
shown in fig. (10.1). Note that the factor of 8 neatly cancels the symmetry 
factor S = 8 of the diagram with sources. 
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Figure 10.1: The three tree-level Feynman diagrams that contribute to the 
connected correlation function {0\Fy>(x\)y>(x 2 )y>{x' l )ip(x' 2 )\t))c- 


This is a general result for tree diagrams (those with no closed loops): 
once the sources have been stripped off and the endpoints labeled, each 
diagram with a distinct endpoint labeling has an overall symmetry factor 
of one. The tree diagrams for a given process represent the lowest-order (in 
g ) nonzero contribution to that process. 

We now have 

{0\Tg}{xi)g}{x 2 )^(x , 1 )g>(x 2 )\0) c 

= {i9? (} ) 5 Jd 4 yd 4 zA(y-z) 

x A(xi-y)A(x 2 -y)A(x' 1 -z)A(x 2 -z) 

+ A(x 1 -y)A(x , 1 -y)A(x 2 -z)A(x , 2 -z) 

+ A(xi-y)A(x' 2 -y)A(x 2 -z)A(x' 1 -z) 

+ 0(g 4 ). (10.9) 

Next, we use eq. (10.9) in the LSZ formula, eq. (10.5). Each Klein-Gordon 
wave operator acts on a propagator to give 

(-df + m 2 )A(xi - y) = <5 4 (x; - y) . (10.10) 

The integrals over the external spacetime labels 2 ,i', 2 ' are then trivial, 
and we get 

</|*> = (^) 2 (i) / d 4 yd 4 zA(y-z)[e i ^y +k *y- k 'F- k ^ 

_|_ e i{k\y+k2Z-k' 1 y-k' 2 z) 

+ e dkiy+k 2 Z-k[z-k' 2 y) 1 + Q( g 4-j _ ( 10-11 ) 


This can be simplified by substituting 


A(y 


d A k 

(27 r) 4 k 2 + m 2 — ie 


(10.12) 
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into eq. (10.9). Then the spacetime arguments appear only in phase factors, 
and we can integrate them to get delta functions: 

W “ %9 J (2^ k 2 + m 2 - ie 

x ( 2Tr) 4 6 4 (ki+k2+k ) (27t) 4 5 4 (k[+k 2 +k) 

+ (2ir) 4 5 4 (ki—k[+k) (2ir) 4 5 4 (k 2 — k 2 +k) 

+ (2v x) 4 6 4 {k l -k' 2 +k) {2ir) 4 8 4 (k[-k2+k) ] + 0{g 4 ) 

= ig 2 (27t) 4 5 4 (A:i+A: 2— fei— /C 2 ) 

1 1 1 

(Aq+fo) 2 + w 2 (Aq — A^) 2 + m 2 (k\ — k' 2 ) 2 + m 2 

+ 0(/). (10.13) 

In eq. (10.13), we have left out the id’s for notational convenience only; m 2 
is really m 2 — ie. The overall delta function in eq. (10.13) tells that that 
four-momentum is conserved in the scattering process, which we should, of 
course, expect. For a general scattering process, it is then convenient to 
define a scattering matrix element T via 

</|*> = (27r) 4 S 4 (k in -k out )iT , (10.14) 

where fc; n and fc ou t are the total four-momenta of the incoming and outgoing 
particles, respectively. 

Examining the calculation which led to eq. (10.13), we can take away 
some universal features that lead to a simple set of Feynman rules for 
computing contributions to iT for a given scattering process. The Feynman 
rules are: 

1. Draw lines (called external lines ) for each incoming and each outgoing 
particle. 

2. Leave one end of each external line free, and attach the other to a 
vertex at which exactly three lines meet. Include extra internal lines 
in order to do this. In this way, draw all possible diagrams that are 
topologically inequivalent. 

3. On each incoming line, draw an arrow pointing towards the vertex. 
On each outgoing line, draw an arrow pointing away from the vertex. 
On each internal line, draw an arrow with an arbitrary direction. 

4. Assign each line its own four-momentum. The four-momentum of 
an external line should be the four-momentum of the corresponding 
particle. 
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Figure 10.2: The tree-level s-, t-. and u-channel diagrams contributing to 
iT for two particle scattering. 

5. Think of the four-momenta as flowing along the arrows, and conserve 
four-momentum at each vertex. For a tree diagram, this fixes the 
momenta on all the internal lines. 

6. The value of a diagram consists of the following factors: 
for each external line, 1; 

for each internal line with momentum k, — i/(k 2 + m 2 — ie); 
for each vertex, iZ g g. 

7. A diagram with L closed loops will have L internal momenta that are 
not fixed by rule #5. Integrate over each of these momenta £i with 
measure d 4 ^/(27r) 4 . 

8. A loop diagram may have some leftover symmetry factors if there are 
exchanges of internal propagators and vertices that leave the diagram 
unchanged; in this case, divide the value of the diagram by the sym- 
metry factor associated with exchanges of internal propagators and 
vertices. 

9. Include diagrams with the counterterm vertex that connects two prop- 
agators, each with the same four-momentum k. The value of this 
vertex is — i(Ak 2 + Bm 2 ), where A = Z v — 1 and B = Z m — 1, and 
each is 0(g 2 ). 

10. The value of iT is given by a sum over the values of all these diagrams. 

For the two-particle scattering process, the tree diagrams resulting from 
these rules are shown in fig. (10.2). 

Now that we have our procedure for computing the scattering amplitude 
T , we must see how to relate it to a measurable cross section. 


Problems 
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10.1) Use eq. (9.41) of problem 9.5 to rederive eq. (10.9). 

10.2) Write down the Feynman rules for the complex scalar field of prob- 
lem 9.3. Remember that there are two kinds of particles now (which 
we can think of as positively and negatively charged), and that your 
rules must have a way of distinguishing them. Hint: the most direct 
approach requires two kinds of arrows: momentum arrows (as dis- 
cussed in this section) and what we might call “charge” arrows (as 
discussed in problem 9.3). Try to find a more elegant approach that 
requires only one kind of arrow. 

10.3) Consider a complex scalar field ip that interacts with a real scalar 
field x v i a £i = gxp'P- Use a solid line for the ip propagator and 
a dashed line for the x propagator. Draw the vertex (remember the 
arrows!), and find the associated vertex factor. 

10.4) Consider a real scalar field with C\ = \gpd^pd^p. Find the associ- 
ated vertex factor. 

10.5) The scattering amplitudes should be unchanged if we make a field 
redefinition. Suppose, for example, we have 

C = —\d^pd^(p — \rr?ip 2 , (10.15) 

and we make the field redefinition 

ip — > ip + \p 2 . (10.16) 

Work out the lagrangian in terms of the redefined field, and the cor- 
responding Feynman rules. Compute (at tree level) the <pip — > pip 
scattering amplitude. You should get zero, because this is a free-held 
theory in disguise. (At the loop level, we also have to take into ac- 
count the transformation of the functional measure Flip; see section 
85.) 
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11 Cross Sections and Decay Rates 

Prerequisite: 10 


Now that we have a method for computing the scattering amplitude T. we 
must convert it into something that could be measured in an experiment. 

In practice, we are almost always concerned with one of two generic 
cases: one incoming particle, for which we compute a decay rate, or two 
incoming particles, for which we compute a cross section. We begin with 
the latter. 

Let us also specialize, for now, to the case of two outgoing particles as 
well as two incoming particles. In tp 3 theory, we found in section 10 that 
in this case we have 


+ 0(g 4 ) , 

(H-1) 

where k\ and k 2 are the four-momenta of the two incoming particles, k[ and 
k 2 are the four- momenta of the two outgoing particles, and k±+k 2 = k[+k 2 . 
Also, these particles are all on shell: k 2 = —mf. (Here, for later use, we 
allow for the possibility that the particles all have different masses.) 

Let us think about the kinematics of this process. In the center- of - 
mass frame, or CM frame for short, we take ky + k 2 = 0, and choose ki 
to be in the +z direction. Now the only variable left to specify about the 
initial state is the magnitude of ki. Equivalently, we could specify the total 
energy in the CM frame, E\ + E- 2 . However, it is even more convenient to 
define a Lorentz scalar s = — (k\ + k 2 ) 2 ■ In the CM frame, s reduces to 
(Ei + E- 2 ) 2 \ s is therefore called the center- of -mass energy squared. Then, 
since E\ = (k 2 + m 2 ) 1 / 2 and E 2 = (k 2 + m I) 1 / 2 , we can solve for |ki| in 
terms of s, with the result 


T = g 2 


1 


+ 


1 


+ 


1 


(ki+ko) 2 + m 2 (k\— k\) 2 + m 2 (ki—k' 0 ) 2 + m 


ki 



(CM frame) . 


( 11 . 2 ) 


Now consider the two outgoing particles. Since momentum is conserved, 
we must have k) + k' 2 = 0, and since energy is conserved, we must also 
have (E( + E 2 ) 2 = s. Then we find 


|ki| 



(CM frame) . 


(11.3) 


Now the only variable left to specify about the final state is the angle 9 
between ki and k) . However, it is often more convenient to work with the 
Lorentz scalar t = —(k\ — k() 2 , which is related to 9 by 


t. = m\ +m\i — 2E\E{ + 21^11^1 cos 0 . 


(11.4) 
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This formula is valid in any frame. 

The Lorentz scalars s and t are two of the three Mandelstam variables, 
defined as 


s = — (Aq+A^) 2 — — (A^+A: 2 ) 2 , 
t = -(Aq -k[) 2 = ~(k 2 -k 2 ) 2 , 
u = — (Aq — k' 2 ) 2 = —{k 2 —k' 1 ) 2 . (11-5) 


The three Mandelstam variables are not independent; they satisfy the linear 
relation 


s + t + u = m\ + m 2 + m 2 i + m 2 i 


In terms of s, t, and u, we can rewrite eq. (11.1) as 

1 1 1 


T = </ 


rri z — s rri z — t 


+ 


m - — u 


+ 0(g 4 ) , 


( 11 . 6 ) 


(11.7) 


which demonstrates the notational utility of the Mandelstam variables. 

Now let us consider a different frame, the fixed target or FT frame (also 
sometimes called the lab frame), in which particle #2 is initially at rest: 
k 2 = 0 . In this case we have 

|ki| = s 2 — 2 (to? + ?B 9 )s + (m? — m 2 ) 2 (FT frame) . (11.8) 

2ni2 v 

Note that, from eqs. (11.8) and (11.2), 


7^2 1 ki | FT — \/s|ki|cM • 


(11.9) 


This will be useful later. 

We would now like to derive a formula for the differential scattering 
cross section. In order to do so, we assume that the whole experiment is 
taking place in a big box of volume V , and lasts for a large time T. We 
should really think about wave packets coming together, but we will use 
some simple shortcuts instead. Also, to get a more general answer, we will 
let the number of outgoing particles be arbitrary. 

Recall from section 10 that the overlap between the initial and final 
states is given by 

</|z> = (2ir) 4 5 4 (h m -k out )iT . (11.10) 

To get a probability, we must square (f\i), and divide by the norms of the 
initial and final states: 

\(f\i)\ 2 
(f\f)(i\i) ' 


P = 


( 11 . 11 ) 
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The numerator of this expression is 



\(f\i)\ 2 = [(2ir) 4 5\k in -k out )} 2 \T\ 2 . 

(11.12) 

We write the square of the delta function as 


[(27T ) 4 6 4 (k in 

-fcout)] 2 = ( 27r) 4 d 4 ( A: in — k out ) x (27r) 4 <5 4 (0) , 

(11.13) 

and note that 

(2tt) 4 <5 4 (0) = J d A x e i0 ' x = VT . 

(11.14) 

Also, the norm of 

a single particle state is given by 



(k\k) = (2ir) 3 2k°5 3 {0) = 2 k°V . 

(11.15) 

Thus we have 

(i\i) = AE x E 2 V 2 , 
/ 

(11.16) 


O 

CM 

ii 

(11.17) 


3 = i 


where n' is the number of outgoing particles. 

If we now divide eq. (11.11) by the elapsed time T, we get a probability 
per unit time 


(27r) 4 (5 4 (fc in — fc out ) V \T\ 2 
AE X E 2 V 2 TI-=i 2/c'°l/ 


This is the probability per unit time to scatter into a set of outgoing par- 
ticles with precise momenta. To get something measurable, we should sum 
each outgoing three-momentum k'- over some small range. Due to the box, 
all three-momenta are quantized: k( = (27r/L)nl, where V = L 3 , and nl is 
a three- vector with integer entries. (Here we have assumed periodic bound- 
ary conditions, but this choice does not affect the final result.) In the limit 
of large L, we have 


E 


V 


(27 r) 3 


d% . 


(11.19) 


3 


Thus we should multiply P by a factor of V d 3 kj / (2n) 3 for each outgoing 

particle. Then we get 


p _ (27r) 4 (i 4 (fci n -fc ou t) | , 2 -^1- vt/ 

.!/•;, /; 2 t 1 j : 


AEiE 2 V 


(11.20) 
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where we have identified the Lorentz-invariant phase-space differential 


dk = 


d 3 k 


(2 tt) 3 2 k° 


( 11 . 21 ) 


that we first introduced in section 3. 

To convert P to a differential cross section do, we must divide by the 
incident flux. Let us see how this works in the FT frame, where particle 
#2 is at rest. The incident flux is the number of particles per unit volume 
that are striking the target particle (#2), times their speed. We have one 
incident particle (#1) in a volume V with speed v = |ki|/£i, and so the 
incident flux is \\l\\/EiV . Dividing eq. (11.20) by this flux cancels the last 
factor of V, and replaces E\ in the denominator with |ki|. We also set 
E 2 = fn 2 and note that eq. ( 11 . 8 ) gives |ki|m .2 as a function of s; do will 
be Lorentz invariant if, in other frames, we simply use this function as the 
value of |ki|m 2 . Adopting this convention, and using eq. (11.9), we have 


do = 


1 


4|ki| CM y / S 


\T\ 2 dUPS n ,(k 1 +k 2 ) , 


(11.22) 


where |ki| CM is given as a function of s by eq. ( 11 . 2 ), and we have defined 
the rL-body Lorentz-invariant phase-space measure 


dLIPS n /(A;) = (27r) 4 <f 4 (fc-£"=i k[) [] dk) . (11.23) 

i=i 

Eq. (11.22) is our final result for the differential cross section for the scat- 
tering of two incoming particles into n' outgoing particles. 

Let us now specialize to the case of two outgoing particles. We need to 
evaluate 

dLIPS 2 (A0 = (2ir) 4 5 4 (k-k[-k' 2 ) dk[dk' 2 , (11.24) 

where k = k\ + k 2 . Since dLIPS 2 (fc) is Lorentz invariant, we can compute 
it in any convenient frame. Let us work in the CM frame, where k = 
ki + k 2 = 0 and k° = E\ + E 2 = ^/s; then we have 

dLIPS 2 (£0 = 4 ^ 2 ^, KK EE^-yC ) (5 3 (k'| +k' ) <l%d:% . (11.25) 

We can use the spatial part of the delta function to integrate over d 3 k 2 , 
with the result 

dUPS 2 (k) = 5(E[+E' 2 -V~s) d% , (11.26) 
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where now 


E\ = \J'k! l 2 + m\, and E’ 2 = ^/kj 2 + m\, . 

(11.27) 

Next, let us write 


d% = \k[\ 2 d|ki| dD CM , 

(11.28) 


where dfi CM = sin 9 d6 d(f> is the differential solid angle, and 9 is the angle 
between ki and k) in the CM frame. We can carry out the integral over the 
magnitude of kj in eq. (11.26) using / dx6(f(x )) = J2i l/ / (* T *)l~ 1 ) where xi 
satisfies f(xi) = 0. In our case, the argument of the delta function vanishes 
at just one value of |kj_| , the value given by eq. (11.3). Also, the derivative 
of that argument with respect to (kj is 



Putting all of this together, we get 


E[ + E> 2 


l k 'il 


( E'i + E'A 
V E[E’ 2 ) 


k il \fs 
E\E! 2 ' 


dLIPS2(A:) — — — 2 ^ r- dD GM . 

167 T z y/S 

Combining this with eq. (11.22), we have 

d°_ 1 l k ll | T |2 

dD cm 647 r 2 .s |ki| 


(11.29) 


(11.30) 


(11.31) 


where |ki| and |k( | are the functions of s given by eqs. (11.2) and (11.3), 
and dn C M is the differential solid angle in the CM frame. 

The differential cross section can also be expressed in a frame-independent 
manner by noting that, in the CM frame, we can take the differential of 
eq. (11.4) at fixed s to get 


dt = 2 |ki| |k)| d cos 9 


= 2 M ! k i 


dD, cm 
2t r 


Now we can rewrite eq. (11.31) as 

da 1 

dt 647rs|ki| 2 



(11.32) 

(11.33) 


(11.34) 
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where |ki| is given as a function of s by eq. (11.2). 

We can now transform do/dt into do/dD, in any frame we might like 
(such as the FT frame) by taking the differential of eq. (11.4) in that frame. 
In general, though, (k) | depends on 6 as well as s, so the result is more 
complicated than it is in eq. (11.32) for the CM frame. 

Returning to the general case of n! outgoing particles, we can define a 
Lorentz invariant total cross section by integrating completely over all the 
outgoing momenta, and dividing by an appropriate symmetry factor S. If 
there are n' identical outgoing particles of type i, then 


S = (11.35) 

i 

and 

o = ^ J do , (11.36) 

where do is given by eq. (11.22). We need the symmetry factor because 
merely integrating over all the outgoing momenta in dLIPS n / treats the 
final state as being labeled by an ordered list of these momenta. But if 
some outgoing particles are identical, this is not correct; the momenta of 
the identical particles should be specified by an unordered list (because, for 
example, the state aja^jO) is identical to the state <4 a i|0))- The symmetry 
factor provides the appropriate correction. 

In the case of two outgoing particles, eq. (11.36) becomes 


o 


1 

S 


dQ 


do 

dQ 


CM 


2t r 

~S 



i a da 
d cos 9 — 

di 2cm 


(11.37) 

(11.38) 


where S = 2 if the two outgoing particles are identical, and S = 1 if they 
are distinguishable. Equivalently, we can compute o from eq. (11.34) via 

1 rtmax rj/j 

o = - dt -, (11.39) 

S m dt 

where t m \ n and t max are given by eq. (11.4) in the CM frame with cos 9 = — 1 
and +1, respectively. To compute o with eq. (11.38), we should first express 
t and u in terms of s and 9 via eqs. (11.4) and (11.6), and then integrate 
over 9 at fixed s. To compute o with eq. (11.39), we should first express u 
in terms of s and t via eq. (11.6), and then integrate over t at fixed s. 

Let us see how all this works for the scattering amplitude of theory, 
eq. (11.7). In this case, all the masses are equal, and so, in the CM frame, 
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E = -I; \fs for all four particles, and | k^| = |ki| = ^(s — 4 m 2 ) 1 / 2 . Then 
eq. (11.4) becomes 

t = — |(s — 4m 2 )(l — cos 9) . (11.40) 

From eq. (11.6), we also have 

u = — ^(s — 4m 2 )(l + cos0) . (11.41) 

Thus |T| 2 is quite a complicated function of s and 9. In the nonrelativistic 
limit, |ki| <C m or equivalently s — 4m 2 <C m 2 , we have 


T = 


5 g 2 
3 m 2 


8 f s — 4 m 2 \ 

15 y m 2 J 



27 

+ 25 008 



s — 4m 2 


m- 


2 

+ ... 


+ <V). 


(11.42) 


Thus the differential cross section is nearly isotropic. In the extreme rela- 
tivistic limit, |ki| 3> m or equivalently s i$> m 2 , we have 


T = 



3 + cos 2 6 


/ (3 + cos 2 9) 2 
y sin 2 9 



o 

m 

s 


+ 0 ( 5 4 ) • 


(11.43) 


Now the differential cross section is sharply peaked in the forward (9 = 0) 
and backward ( 9 = n) directions. 

We can compute the total cross section cr from eq. (11.39). We have in 
this case j n = — (s — 4m 2 ) and t max = 0. Since the two outgoing particles 
are identical, the symmetry factor is S = 2. Then setting u = 4 m 2 — s — t, 
and performing the integral in eq. (11.39) over t at fixed s, we get 


(7 = 


a 


32ns(s — 4?n 2 ) 

+ 


2 s — 4 m 2 
— ^ + 


m 2 ( s — m 2 ) 2 s — 3 m 2 
s — 3m 2 \ 


4m 2 


( s — m 2 )(s — 2m 2 ) 
In the nonrelativistic limit, this becomes 
25 g A 


In 


rrr 


+ 0(g 6 ). (11.44) 


(7 = 


11527rm 6 


1 79 j s — 4 to 2 \ ^ 


60 


m- 


In the extreme relativistic limit, we get 


( 7 = 


9 


167 T77l 2 S 2 


7 m 2 

1 + 2 V + -” 


+ 0(g 6 ). 


+ 0(g 6 ). 


(11.45) 


(11.46) 
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These results illustrate how even a very simple quantum field theory can 
yield specific predictions for cross sections that could be tested experimen- 
tally. 

Let us now turn to the other basic problem mentioned at the beginning 
of this section: the case of a single incoming particle that decays to n' other 
particles. 

We have an immediate conceptual problem. According to our develop- 
ment of the LSZ formula in section 5, each incoming and outgoing particle 
should correspond to a single-particle state that is an exact eigenstate of 
the exact hamiltonian. This is clearly not the case for a particle that can 
decay. Referring to fig. (5.1), the hyperbola of such a particle must he above 
the continuum threshold. Strictly speaking, then, the LSZ formula is not 
applicable. 

A proper understanding of this issue requires a study of loop corrections 
that we will undertake in section 25. For now, we will simply assume that 
the LSZ formula continues to hold for a single incoming particle. Then we 
can retrace the steps from eq. (11.11) to eq. (11.20); the only change is that 
the norm of the initial state is now 

(i\i) = 2E 1 V (11.47) 

instead of eq. (11.16). Identifying the differential decay rate dT with P then 
gives 

dT = -j- |T| 2 dLIPS n /(A: 1 ) , (11.48) 

where now s = —k\ = rnj . In the CM frame (which is now the rest frame of 
the initial particle), we have Ei = mi; in other frames, the relative factor 
of E\ /mi in dT accounts for relativistic time dilation of the decay rate. 

We can also define a total decay rate by integrating over all the outgoing 
momenta, and dividing by the symmetry factor of eq. (11.35): 

r = |/'dT. (11.49) 

We will compute a decay rate in problem 11.1 

Reference Notes 

For a derivation with wave packets, see Brown, Rzykson & Zuber, or Peskin 
& Schroeder. 


Problems 
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11.1) a) Consider a theory of a two real scalar fields A and B with an 
interaction C\ = gAB 2 . Assuming that rn A > 2 m B , compute the 
total decay rate of the A particle at tree level. 

b) Consider a theory of a real scalar field cp and a complex scalar field 
X with C\ = gpx*X- Assuming that rn v > 2 m x , compute the total 
decay rate of the ip particle at tree level. 


11 . 2 ) Consider Compton scattering , in which a massless photon is scattered 
by an electron, initially at rest. (This is the FT frame.) In problem 
59.1, we will compute \T\ 2 for this process (summed over the possible 
spin states of the scattered photon and electron, and averaged over 
the possible spin states of the initial photon and electron), with the 
result 


ITI 2 = 32ttV 


m 4 + m 2 (3s + u) — su m 4 + m 2 (3u + s) — su 


+ 


(m 2 — s ) 2 

2 m 2 (s + u + 2 m 2 ) 
(m 2 — s)(m 2 — u ) 


(m 2 — u ) 2 
+ 0(a 4 ) (11.50) 


where a = 1/137.036 is the fine-structure constant. 

a) Express the Mandelstam variables s and u in terms of the initial 
and final photon energies lo and lo' . 

b) Express the scattering angle 0 FT between the initial and final pho- 
ton three- momenta in terms of to and lo' . 


c) Express the differential scattering cross section do /dPl FT in terms 
of lo and lo' . Show that your result is equivalent to the Klein- Nishina 
formula 


do 

dCl FT 


a 2 lo ' 2 


2 m 2 


LO 


LO 


LO 


— H sim 


LO 


LO 


(11.51) 


11.3) Consider the process of muon decay , — > e“F e r' A1 . In section 88 , 

we will compute |T | 2 for this process (summed over the possible spin 
states of the decay products, and averaged over the possible spin 
states of the initial muon), with the result 

|T | 2 = 64 Glikx-k'^ik'i-k'z) , (11.52) 


where Gp is the Fermi constant, k± is the four-momentum of the 
muon, and k'i 23 are the four-momenta of the v e , and e ~ , respec- 
tively. In the rest frame of the muon, its decay rate is therefore 


r = 


32 Gl 
m 


■k' 2 )(k' v k'z) dLlPS 3 (h) , 


(11.53) 
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where k± = (m, 0), and m is the muon mass. The neutrinos are 
massless, and the electron mass is 200 times less than the muon mass, 
so we can take the electron to be massless as well. To evaluate T, we 
perform the following analysis. 

a) Show that 

T = — £ / dk' 3 k {lJ k' iu / kf l k\ l/ dUPS 2 (k 1 -k' 3 ) . (11.54) 

b) Use Lorentz invariance to argue that, for my = m 2 ' = 0, 

J k'/'k'S dUPS 2 (k) = Ak 2 g^ + Bk^k u , (11.55) 

where A and B are numerical constants. 

c) Show that, for my = m 2 i = 0, 

[ dLlPSAk) = — . (11.56) 

J 8tt 

d) By contracting both sides of eq. (11.55) with g IJV and with k^k v , 
and using eq. (11.56), evaluate A and B. 

e) Use the results of parts (b) and (d) in eq. (11.54). Set k\ = (?n, 0 ), 
and compute dT/dE e \ here E e = E 3 is the energy of the electron. 
Note that the maximum value of E e is reached when the electron 
is emitted in one direction, and the two neutrinos in the opposite 
direction; what is this maximum value? 

f) Perform the integral over E e to obtain the muon decay rate T. 

g) The measured lifetime of the muon is 2.197 x 10~ 6 s. The muon 
mass is 105.66 MeV. Determine the value of Gy in GeV~ 2 . (Your 
answer is too low by about 0.2%, due to loop corrections to the decay 
rate.) 

h) Define the energy spectrum of the electron P(E e ) = T~ 1 dT / dE e . 
Note that P(E e )dE e is the probability for the electron to be emit- 
ted with energy between E e and E e + dE e . Draw a graph of P{E e ) 
vs. Ue/^V 

11.4) Consider a theory of three real scalar fields ( A , B, and C ) with 

C = — ^d^Ad^A — \m 2 A A 2 

- \d^B8^B - \m- B B 2 

- \d^Cd^C - \m 2 c C 2 
+ gABC . 


(11.57) 



11: Cross Sections and Decay Rates 


103 


Write down the tree-level scattering amplitude (given by the sum of 
the contributing tree diagrams) for each of the following processes: 

AA — > AA , 

AA — > AB , 

AA — > BB , 

AA-+ BC , 

AB — ► AB , 

AB ^ AC . (11.58) 


Your answers should take the form 


T = g 2 


+ 


mt — s 


c t 

ml — t 


+ 




mf ( — u 


(11.59) 


where, in each case, each c t is a positive integer, and each rrif is m 2 A 
or m\ or rri 2 c . Hint: T may be zero for some processes. 
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12 Dimensional Analysis with h = c = 1 

Prerequisite: 3 


We have set U = c = 1. This allows us to convert a time T to a length L 
via T = c~ 1 L 1 and a length L to an inverse mass M -1 via L = 

Thus any quantity A can be thought of as having units of mass to some 
power (positive, negative, or zero) that we will call [A]. For example, 


[m] = +1 , (12A) 

W = -1 , ( 12 - 2 ) 

[&*] = +1 , (12.3) 

[d d x\ = -d . (12.4) 


In the last line, we have generalized our considerations to theories in d 
spacetime dimensions. 

Let us now consider a scalar held in d spacetime dimensions with la- 
grangian density 

N 

C = h.9n^ n ■ (12.5) 

n = 3 


The action is 

S = I d d xC , 

and the path integral is 


Z(J) 


J Vip exp 


d d x (C + Jtp) 


( 12 . 6 ) 


(12.7) 


From eq. (12.7), we see that the action S must be dimensionless, because 
it appears as the argument of the exponential function. Therefore 


[S] = 0 . (12.8) 

Combining eqs. (12.4) and (12.8) yields 

[C]=d . (12.9) 

Then, from eqs. (12.9) and (12.3), and the fact that d^cpd^cp is a term in C, 
we see that we must have 


W\ = W- 2 ). ( 12 . 10 ) 

Then, since g n ^p n is also a term in £, we must have 

[ g n \ = d - \n(d - 2) . 


( 12 . 11 ) 
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In particular, for the <p 3 theory we have been working with, we have 

[g 3 ] = l(6-d). (12.12) 

Thus we see that the coupling constant of (p 3 theory is dimensionless in 
d = 6 spacetime dimensions. 

Theories with dimensionless couplings tend to be more interesting than 
theories with dimensionful couplings. This is because any nontrivial de- 
pendence of a scattering amplitude on a coupling must be expressed as a 
function of a dimensionless parameter. If the coupling is itself dimension- 
ful, this parameter must be the ratio of the coupling to the appropriate 
power of either the particle mass m (if it isn’t zero) or, in the high-energy 
regime s 3> m 2 , the Mandelstam variable s. Thus the relevant parame- 
ter is g s~ If [g] is negative [and it usually is: see eq. (12.11)], then 
g g-ig]/ 2 blows up at high energies, and the perturbative expansion breaks 
down. This behavior is connected to the nonrenormalizability of theories 
with couplings with negative mass dimension, a subject we will take Tip in 
section 18. It turns out that such theories require an infinite number of 
input parameters to make sense; see section 29. In the opposite case, [g] 
positive, the theory becomes trivial at high energy, because g s - ^/ 2 goes 
rapidly to zero. 

Thus the case of [g\ = 0 is just right: scattering amplitudes can have a 
nontrivial dependence on g at all energies. 

Therefore, from here on, we will be primarily interested in p > 3 theory in 
d = 6 spacetime dimensions, where [53] = 0. 

Problems 

12.1) Express he in GeV frn, where 1 fm = 1 Fermi = IIP 13 cm. 

12.2) Express the masses of the proton, neutron, pion, electron, muon, and 
tau in GeV. 

12.3) The proton is a strongly interacting blob of quarks and gluons. It 
has a nonzero charge radius r p , given by r 2 = / d 3 x p(r)r 2 , where p(r) 
is the quantum expectation value of the electric charge distribution 
inside the proton. Estimate the value of r p , and then look up its 
measured value. How accurate was your estimate? 
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13 The Lehmann-Kallen Form of the Exact 

Propagator 

Prerequisite: 9 


Before turning to the subject of loop corrections to scattering amplitudes, 
it will be helpful to consider what we can learn about the exact propagator 
A (x — y) from general principles. We define the exact propagator via 

A(x -y) = i(0|T<p(x)<p(y)|0) . (13.1) 

We take the field <p(x) to be normalized so that 

(0|(/j(x)|0) = 0 and (fc|</?(a;)|0) = e~ lkx . (13-2) 

In d spacetime dimensions, the one-particle state | k) has the normalization 
(k\k') = (2vr) d - 1 2u> <F -1 (k - k') , (13.3) 

with uj = (k 2 + m 2 ) 1 / 2 . The corresponding completeness statement is 


J dk \k)(k\ = h , 


(13.4) 


where I\ is the identity operator in the one-particle subspace, and 


dk = 


d d ~ l k 
{2n) d ~ l 2 uj 


(13.5) 


is the Lorentz invariant phase-space differential. We also define the exact 
momentum-space propagator A (k 2 ) via 


A 



d d k 
(2vr ) d 


e ik(x- y ) _ 


In free-held theory, the momentum-space propagator is 


A (k 2 ) 


1 

k 2 + m 2 — ie 


(13.6) 


(13.7) 


It has an isolated pole at k 2 = — m 2 with residue one; m is the actual, phys- 
ical mass of the particle, the mass that enters into the energy-momentum 
relation. 

We begin our analysis with eq. (13.1). We take x° > y°, and insert 
a complete set of energy eigenstates between the two fields. Recall from 
section 5 that there are three general classes of energy eigenstates: 
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1. The ground state or vacuum |0), which is a single state with zero 
energy and momentum. 

2. The one particle states | k), specified by a three-momentum k and 
with energy oj = (k 2 + m 2 ) 1 / 2 . 

3. States in the multiparticle continuum \k,n), specified by a three- 
momentum k and other parameters (such as relative momenta among 
the different particles) that we will collectively denote as n. The 
energy of one of these states is oj = (k 2 + M 2 ) 1 / 2 , where M > 2 m; M 
is one of the parameters in the set n. 

Thus we get 

(0|¥>(®Ms/)| 0) = (0|<p(x)|0)(0|<p(y)|0) 

+ J dk (0\<p(x)\k)(k\ip(y)\0) 

+ Y1 [ dk (°\ ( P( x )\ k ,n){k,n\<p(y)\0) • ( 13 - 8 ) 

n ** 

The sum over n is schematic, and includes integrals over continuous pa- 
rameters like relative momenta. 

The first two terms in eq. (13.8) can be simplified via eq. (13.2). Also, 
writing the held as <p{x) = exp(— iP fJj x fJj )(p(0)exp(+iP y x fl ), where P y is the 
energy-momentum operator, gives us 

(k,n\g>(x)\0) = e~ lkx (k,n\<p(0)\0) , (13.9) 

where k° = (k 2 + M 2 ) 1 / 2 . We now have 

(0|^(xV(y)|0> = f dke ik ( x ~ y) + J2 [ dke iHx - y) \(k,n\^(0)\0)\ 2 . (13.10) 

J n J 

Next, we define the spectral density 

P ( S ) = Y,\( k ’ n \<p(.m\ 2 s(s- M >) . (i3.ii) 

n 

Obviously, p(s) > 0 for s > 4m 2 , and p(s) = 0 for s < Am 2 . Now we have 

roo r 

dke iHx-y) + J ^dsp(s) J dke ik ^ x ~ y) . (13.12) 

In the hrst term, k° = (k 2 +m 2 ) 1//2 , and in the second term, k° = (k 2 +s) 1//2 . 
Clearly we can also swap x and y to get 

dke~ ik{x - y) + j o dsp{s) J dk e~ lkix - y) (13.13) 
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as well. We can then combine eqs. (13.12) and (13.13) into a formula for 
the time-ordered product 


(0|TV(x)<p(y)|0) = 0(x o -y°)(O|<p(x)<p(y)|O) + 0(y°-x°)(0\p(y)v(x)\0), 

(13.14) 


where 9(t) is the unit step function, by means of the identity 


d d k 


Jk(x-y) 


(2Tr) d k 2 + m 2 — ie 


i9{x°—y°) J dke ik{ - x ~ y) 

+ i9(y°-x ° ) J dke~ ik{ - x -^ ; (13.15) 


the derivation of eq. (13.15) was sketched in section 8. Combining eqs. (13.12- 
13.15), we get 


*(0|TV(x)<p(y)|0) = 


d d k 

(2vr) a 


Jk{x-y) 


1 


k 2 + m 2 — ie 

1 1 


+ / ds p(s ) — 

JAm 2 k z + s - ie 


Comparing eqs. (13.1), (13.6), and (13.16), we see that 


A (k 2 ) = 


1 


k 2 + m 2 — ie 74 m 2 


ds p(s ) 


k 2 + s — i 


ie 


(13.16) 


(13.17) 


This is the Lehmann-Kallen form of the exact momentum-space propagator 
A (k 2 ). We note in particular that A (k 2 ) has an isolated pole at k 2 = — m 2 
with residue one , just like the propagator in free- field theory. 


Problems 


13.1) Consider an interacting scalar field theory in d spacetime dimensions, 

C = -^Z^ipd^ip - 7}Z m m 2 (p 2 - Ci(p) > (13.18) 

where C\{p) is a function of p (and not its derivatives). The exact 
momentum-space propagator for p can be expressed in Lehmann- 
Kallen form by eq. (13.17). Find a formula for the renormalizing fac- 
tor Z v in terms of p{s). Hint: consider the commutator \p{x),p{y)\. 
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14 Loop Corrections to the Propagator 

Prerequisite: 10, 12, 13 


In section 10, we wrote the exact propagator as 


jA(xi-x 2 ) = (0|T<^(xi)^(x2)|0) = SiS2iW(J) , 


(14.1) 


where iW(J) is the sum of connected diagrams, and 8 t acts to remove a 
source from a diagram and label the corresponding propagator endpoint 
Xi . In (p 3 theory, the 0(g 2 ) corrections to AA(xi— x 2 ) come from the di- 
agrams of fig. (14.1). To compute them, it is simplest to work directly in 
momentum space, following the Feynman rules of section 10. An appro- 
priate assignment of momenta to the lines is shown in fig. (14.1); we then 
have 


i A (fc 2 ) = \A(k 2 ) + A A (k 2 ) \iU(k 2 )] ±A(k 2 ) + 0(g 4 ) 


where 

A(k 2 ) 

is the free-held propagator, and 


1 

k 2 + m 2 — ie 


/ \ 2 r 4 d P 

® (k 2 ) = \{ig? (A) J j^- d m+k) 2 )A(t 2 ) 
— i(Ak 2 + Bm 2 ) + 0(g 4 ) 


(14.2) 

(14.3) 


(14.4) 


is the self-energy. Here we have written the integral appropriate for d 
spacetime dimensions; for now we will leave d arbitrary, but later we will 
want to focus ou d = 6, where the coupling g is dimensionless. 

In the first term in eq. (14.4), the factor of one-half is the symmetry 
factor associated with exchanging the top and bottom semicircular prop- 
agators. Also, we have written the vertex factor as ig rather than iZ g g 
because we expect Z g = 1 + 0(g 2 ), and so the Z g — 1 contribution can 
be lumped into the 0(g 4 ) term. In the second term, A = Z^ — 1 and 
B = Z rn — 1 are both expected to be 0(g 2 ). 

It will prove convenient to define n(/c 2 ) to all orders via the geometric 
series 


\A{k 2 ) = \ A (k 2 ) + jA(k 2 ) ^n(/c 2 )j XA(k 2 ) 

+ AA (k 2 ) m(fe 2 ) AA(fe 2 ) \iU(k 2 )} AA (k 2 ) 


+ ... . 


(14.5) 
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Figure 14.1: The 0(g 2 ) corrections to the propagator. 



Figure 14.2: The geometric series for the exact propagator. 


This is illustrated in fig. (14.2). The sum in eq. (14.5) will include all the 
diagrams that contribute to A (k 2 ) if we take ill(£: 2 ) to be given by the sum 
of all diagrams that are one-particle irreducible , or 1PI for short. A diagram 
is 1PI if it is still connected after any one line is cut. The 1PI diagrams 
that make an 0(g 4 ) contribution to ill(k 2 ) are shown in fig. (14.3). When 
writing down the value of one of these diagrams, we omit the two external 
propagators. 

If we sum up the series in eq. (14.5), we get 


A (k 2 ) 


1 

k 2 + m 2 — ie — II(&: 2 ) 


(14.6) 


In section 13, we learned that the exact propagator has a pole at k 2 = —m 2 
with residue one. This is consistent with eq. (14.6) if and only if 


II(-m 2 ) = 0 , (14.7) 

n'(-m 2 ) = 0 , (14.8) 


where the prime denotes a derivative with respect to k 2 . We will use 
eqs. (14- 7) and (14-8) to fix the values of A and B. 





Figure 14.3: The 0(g 4 ) contributions to iII(A; 2 ). 
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Next we turn to the evaluation of the 0(g 2 ) contribution to ill(k 2 ) in 
eq. (14.4). We have the immediate problem that the integral on the right- 
hand side diverges at large £ for d > 4. We faced a similar situation in 
section 9 when we evaluated the lowest-order tadpole diagram. There we 
introduced an ultraviolet cutoff A that modified the behavior of A(£ 2 ) at 
large l 2 . Here, for now, we will simply restrict our attention to d < 4, 
where the integral in eq. (14.4) is finite. Later we will see what we can say 
about larger values of d. 

We will evaluate the integral in eq. (14.4) with a series of tricks. We 
first use Feynman’s formula to combine denominators, 

— — = I dF n {x\A\ + . . . + x n A n ) n , (14.9) 

A\... A n J 

where the integration measure over the Feynman parameters x t is 

J dF n = (n— 1)! J dx\ . . . dx n 5{x\ + . . . + x n — 1) . (14.10) 

This measure is normalized so that 

J dF n 1 = 1. (14.11) 


We will prove eq. (14.9) in problem 14.1. 

In the case at hand, we have 

A((k+£) 2 )A{£ 2 ) = — — L - 

(. 1 2 + m 2 )((£ + k) 2 + m 2 ) 

Ip -i —2 

dx x{{£ + k) 2 + m 2 ) + (1— x)(£ 2 + m 2 ) 

Ip _2 

dx £ 2 + 2x£-k + xk 2 + m 2 

1 1 _2 
dx (£ + xk) 2 + x(l— x)k 2 + m 2 

Ip 1 —2 

dx q 2 + D , (14.12) 

where we have suppressed the ie’s for notational convenience; they can be 
restored via the replacement m 2 — > m 2 —ie. In the last line we have defined 



q = £ + xk 


(14.13) 


and 


D = x{\— x)k 2 + m 2 . 


(14.14) 



14: Loop Corrections to the Propagator 


112 


Im q° 


X 





X 


Reg 0 


Figure 14.4: The q° integration contour along the real axis can be rotated 
to the imaginary axis without passing through the poles at q° = —u + ie 
and q° = +cj — ie. 


We then change the integration variable in eq. (14.4) from t to q: the jaco- 
bian is trivial, and we have d d t = d d q. 

Next, think of the integral over q° from — oo to +oo as a contour integral 
in the complex q° plane. If the integrand vanishes fast enough as \q°\ — > oo, 
we can rotate this contour clockwise by 90°, as shown in fig. (14.4), so that 
it runs from —zoo to -H'oo. In making this Wick rotation , the contour 
does not pass over any poles. (The ze’s are needed to make this statement 
unambiguous.) Thus the value of the integral is unchanged. It is now 
convenient to define a euclidean d-dimensional vector q via q° = iq d and 
qj = qj\ then q 2 = q 2 , where 

Q 2 =ql + ■■■ + qj ■ (14.15) 

Also, d d q = i d d q. Therefore, in general, 

J d d q f(q 2 -ie) = i j d d q f(q 2 ) (14.16) 

as long as f(q 2 ) — > 0 faster than l/q d as q — > oo. 

Now we can write 

U{k 2 ) = y 2 I{k 2 ) - Ak 2 - Bm 2 + 0(g 4 ) , (14.17) 

where 

7(/r ) = J Q dxj ^2 + d) 2 ■ ( 14 - 18 ) 

It is now straightforward to evaluate the d-dimensional integral over q in 
spherical coordinates. 

Before we perform this calculation, however, let us introduce another 
trick, one that can simplify the task of fixing A and B through the impo- 
sition of eqs. (14.7) and (14.8). Here is the trick: differentiate n(/c 2 ) twice 
with respect to k 2 to get 

n "(k 2 ) = y 2 i"(k 2 ) + o(g A ), 


(14.19) 
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where, from eqs. (14.18) and (14.14), 

I"(g) = £ dz 6* 2 U ~*) 2 / tfTW ' (14 ' 20) 

Then, after we evaluate these integrals, we can get Ti{k 2 ) by integrating with 
respect to k 2 , subject to the boundary conditions of eqs. (14.7) and (14.8). 
In this way we can construct IT(/c 2 ) without ever explicitly computing A 
and B. 

Notice that this trick does something else for us as well. The integral 
over q in eq. (14.20) is finite for any d < 8, whereas the original integral in 
eq. (14.18) is finite only for d < 4. This expanded range of d now includes 
the value of greatest interest, d = 6. 

How did this happen? We can gain some insight by making a Taylor 
expansion of n(fc 2 ) about k 2 = —m 2 : 


U(k 2 ) = 


\g 2 I(—m 2 ) + (A — B)m 2 


+ 


+ 


\g 2 l'(—m 2 ) + A {k 2 + m 2 ) 


\g 2 1" (—to 2 ) ( k 2 + m 2 ) 2 + 


+ 0(g 4 


(14.21) 


From eqs. (14.18) and (14.14), it is straightforward to see that /(— to 2 ) 
is divergent for d > 4, /'(—to 2 ) is divergent for d > 6, and, in general, 
/( n )(_ m 2 ) ig divergent for d > 4 + 2 n. We can use the 0(g 2 ) terms in 
A and B to cancel off the ^g 2 I(—m 2 ) and \g 2 V[—m 2 ) terms in n(/ 2 ), 
whether or not they are divergent. But if we are to end up with a finite 
n(/c 2 ), all of the remaining terms must be finite, since we have no more free 
parameters left to adjust. This is the case for d < 8. 

Of course, for 4 < d < 8, the values of A and B (and hence the la- 
grangian coefficients Z = 1 + A and Z m = 1 + B) are formally infinite, and 
this may be disturbing. However, these coefficients are not directly mea- 
surable, and so need not obey our preconceptions about their magnitudes. 
Also, it is important to remember that A and B each includes a factor of 
g 2 \ this means that we can expand in powers of A and B as part of our 
general expansion in powers of g. When we compute n(A; 2 ) (which enters 
into observable cross sections), all the formally infinite numbers cancel in 
a well-defined way, provided d < 8. 

For d > 8, this procedure breaks down, and we do not obtain a finite 
expression for n(/c 2 ). In this case, we say that the theory is nonrenormaliz- 
able. We will discuss the criteria for renormalizability of a theory in detail 
in section 18. It turns out that (p 3 theory is renormalizable for d < 6. (The 
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problem with 6 < d < 8 arises from higher-order corrections, as we will see 
in section 18.) 

Now let us return to the calculation of P(k 2 ). Rather than using the 
trick of first computing II "(k 2 ), we will instead evaluate II(/c 2 ) directly 
from eq. (14.18) as a function of d for d < 4. Then we will analytically 
continue the result to arbitrary d. This procedure is known as dimensional 
regularization. Then we will fix A and B by imposing eqs. (14.7) and (14.8), 
and finally take the limit d —> 6. 

We could just as well use the method of section 9. Making the replace- 
ment 


A(p 2 


1 


A 2 


p 2 + m 2 — ie p 2 + A 2 — ie 


(14.22) 


where A is the ultraviolet cutoff, renders the 0(g 2 ) term in II(A; 2 ) finite 
for d < 8; This procedure is known as Pauli-Villars regularization. We 
then evaluate II(A: 2 ) as a function of A, fix A and B by imposing eqs. (14.7) 
and (14.8), and take the A — ► oo limit. Calculations with Pauli-Villars 
regularization are generally much more cumbersome than they are with 
dimensional regularization. However, the final result for n(/c 2 ) is the same. 
Eq. (14.21) demonstrates that any regularization scheme will give the same 
result for d < 8, at least as long as it preserves the Lorentz invariance of 
the integrals. 

We therefore turn to the evaluation of /(A: 2 ), eq. (14.18). The angu- 
lar part of the integral over q yields the area Q d of the unit sphere in d 
dimensions, which is 


H d = 


2n d / 2 

rp ; 


(14.23) 


this is most easily verified by computing the gaussian integral / d d qe~ ^ in 
both cartesian and spherical coordinates. Here T(x) is the Euler gamma 
function; for a nonnegative integer n and small x, 


r(n+l) = n! , 


T(n+i) 

T(— n+x) 


(2 n)! 

n!2" 

(- 1 )’ 

n! 


7T , 

1 

X 


7 + 5ZL i k 1 +0{x) 


(14.24) 

(14.25) 

(14.26) 


where 7 = 0.5772 ... is the Euler-Mascheroni constant. 

The radial part of the q integral can also be evaluated in terms of gamma 
functions. The overall result (generalized slightly) is 


f d d g {g 2 ) a = T{b-a-y)T(a+kd) D _ (b _ a _ d/2) 

J (2ir) d {q 2 + D) b (4vr)' i / 2 r(6)r(i(i) 
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We will make frequent use of this formula throughout this book. In the 
case of interest, eq. (14.18), we have a = 0 and 6 = 2. 

There is one more complication to deal with. Recall that we want to 
focus on d = 6 because in that case g is dimensionless. However, for general 
d , g has mass dimension e/2, where 

e = 6 - d . (14.28) 

To account for this, we introduce a new parameter jl with dimensions of 
mass, and make the replacement 

9 - gjd £/2 • (14.29) 


In this way g remains dimensionless for all e. Of course, fl is not an actual 
parameter of the d = 6 theory. Therefore, nothing measurable (like a cross 
section) can depend on it. 

This seemingly innocuous statement is actually quite powerful, and will 
eventually serve as the foundation of the renormalization group. 

We now return to eq. (14.18), use eq. (14.26), and set d = 6 — e; we get 


I(k 2 ) = 


r(— 1+ 


r - 1 / 4 tt \ £ / 2 

dxD ( — J 


/ o 


(47 r) 3 

Hence, with the substitution of eq. (14.29), and defining 

,2 


a = 


9 


(47 r) 


for notational convenience, we have 


> n 


2 \£/2 


Il(k 2 ) = \aT(- 1+f) 

\ 

— Ak 2 — Bm 2 + 0(a 2 ) . 

Now we can take the e — > 0 limit, using eq. (14.26) and 

A e ' 2 = 1 + f In A + 0{e 2 ) . 


(14.30) 


(14.31) 


(14.32) 


(14.33) 


The result is 

H (k 2 ) = 


(! + l)(l fc 2 +m 2) + j\, xDln ( 47r k 

— Ak 2 — Bm 2 + 0(a 2 ) . 


~2 a 


e^D 


(14.34) 
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Here we have used Jq dx D = ^k 2 + m 2 . It is now convenient to define 

p = VLk e -7 / 2 p , (14.35) 

and rearrange things to get 

n(fc 2 ) = \a [ dxDln(D/m 2 ) 

Jo 

-{ga \ + \n(p/m) + \ + A^k 2 
— | a 4^ + In (p/m) + ^ + H j m 2 + 0(a 2 ) . (14.36) 

If we take A and B to have the form 

A = — \ + ln(p/m) + \ + k a +0(a 2 ), (14.37) 

B = — a + In (p/m) + | + kb + 0(a 2 ) , (14.38) 

where ka and kb are purely numerical constants, then we get 

U(k 2 ) = \a ( dx Dln(D/m 2 ) + a(j;K,Ak 2 + KBm 2 ^J + 0(a 2 ) . (14.39) 

•J 0 

Thus this choice of A and B renders n(A; 2 ) finite and independent of p, as 
required. 

To fix ka and kb, we must still impose the conditions n(— m 2 ) = 0 and 
n' (— ?n 2 ) = 0. The easiest way to do this is to first note that, schematically, 

n (k 2 ) = \a f dx D In D + linear in k 2 and m 2 + 0(a 2 ) . (14.40) 

“ Jo 

We can then impose n(— m 2 ) = 0 via 

n (k 2 ) = ba [ dxD\n(D/Do) + linear in (k 2 + m 2 ) + 0(a 2 ) . (14.41) 

Jo 

where 

D 0 = D = [1— x(l— x)]m 2 . (14.42) 

k z =—m z 

Now it is straightforward to differentiate eq. (14.41) with respect to k 2 , and 
find that n '(— m 2 ) vanishes for 

n(A: 2 ) = J dx D Iu(D/Dq) — j^a(k 2 + m 2 ) + 0(a 2 ) . (14.43) 

The integral over x can be done in closed form; the result is 
n (k 2 ) = c\k 2 + C 2 m 2 + 2 k 2 f(r) + 0(a 2 ) , 


(14.44) 
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Figure 14.5: The real and imaginary parts of Il(k 2 )/(k 2 + m 2 ) in units of 
a. 


where c\ = 3— 7T\/3, C 2 = 3— 2n\/3, and 


/(?’) = r 3 tanh 1 (l/r) , (14.45) 

r = (1 + 4m 2 /A: 2 ) 1/,2 . (14.46) 

There is a branch point at k 2 = —4m 2 , and II(&: 2 ) acquires an imaginary 
part for k 2 < — 4m 2 ', we will discuss this further in the next section. 

We can write the exact propagator as 


A(k 2 ) 


1 — U(k 2 )/(k 2 + m 2 ) J k 2 + m 2 — ie 


(14.47) 


In fig. (14.5), we plot the real and imaginary parts of II (k 2 )/(k 2 + m 2 ) in 
units of a. We see that its values are quite modest for the plotted range. 
For much larger values of |fc 2 |, we have 


U(k 2 ) , 

~ —r\ 

k 2 + m 2 12 


ln(k 2 /m 2 ) + ci + 0(a 2 ) 


(14.48) 


If we had kept track of the ie' s, k 2 would be k 2 — ie; when k 2 is negative, 
we have ln(/c 2 — ie) = In \k 2 \ — m. The imaginary part of IT (k 2 )/(k 2 + m 2 ) 
therefore approaches the asymptotic value of + 0(a 2 ) when k 2 is 

large and negative. The real part of Il(k 2 )/(k 2 + m 2 ), however, continues 
to increase logarithmically with \k 2 \ when \k 2 \ is large. We will begin to 
address the meaning of this in section 26. 
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Problems 


14.1) Derive a generalization of Feynman’s formula, 


1 

J\V-\ J^Xn 


i /■ 1 

Tit F(aj) (n— 1)! J " ' 


(14.49) 


Hint: start with 

^ = J o °° dtt a - l e~ M , (14.50) 

which defines the gamma function. Put an index on A. a, and t, and 
take the product. Then multiply on the right-hand side by 

roo 

1 = ds 5(s - ■ (14.51) 

Jo 


Make the change of variable ti = sxi, and carry out the integral over 
s. 


14.2) Verify eq. (14.23). 

14.3) a) Show that 

J d d q q p f(q 2 ) = 0 , (14.52) 

/ d d q q^f(q 2 ) = C 2 g^ f d d q q 2 f(q 2 ) , (14.53) 

and evaluate the constant C 2 in terms of d. Hint: use Lorentz symme- 
try to argue for the general structure, and evaluate C 2 by contracting 
with g^. 

b) Similarly evaluate / d d q q p q u q p q a f (q 2 ) . 

14.4) Compute the values of ka and kb- 

14.5) Compute the 0( A) correction to the propagator in p 4 theory (see 
problem 9.2) in d = 4 — e spacetime dimensions, and compute the 
O(A) terms in A and B. 

14.6) Repeat problem 14.5 for the theory of problem 9.3. 

14.7) Renormalization of the anharmonic oscillator. Consider an anhar- 
rnonic oscillator, specified by the lagrangian 

L = \Zq 2 - \Z w u 2 q 2 - Z x Xia 3 q 4 . (14.54) 

We set ft = 1 and rn = 1; A is then dimensionless. 
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a) Find the hamiltonian H corresponding to L. Write it as H = 
Hq + H\, where Hq = \P 2 + ^oj 2 Q 2 , and [Q, P ] = i. 

b) Let |0) and |1) be the ground and first excited states of Hq, and 

let |fl) and |I) be the ground and first excited states of H. (We 
take all these eigenstates to have unit norm.) We define lo to be the 
excitation energy of H, to = E\ — Eq,. We normalize the position 
operator Q by setting (I|<5|fl) = (1|Q|0) = (2 cu) -1 / 2 . Finally, to 
make things mathematically simpler, we set Z\ equal to one, rather 
than using a more physically motivated definition. Write Z = 1 + A 
and Z w = 1 + B, where A = + 0( A 2 ) and B = KgA + 0(A 2 ). 

Use Rayleigh-Schrodinger perturbation theory to compute the O(A) 
corrections to the unperturbed energy eigenvalues and eigenstates. 

c) Find the numerical values of ka and kb that yield u = E\ — Eq 
and (I|Q|fi) = (2w) -1 / 2 . 

d) Now think of the lagrangian of eq. (14.54) as specifying a quantum 
field theory in d = 1 dimensions. Compute the 0(A) correction to 
the propagator. Fix k a and kb by requiring the propagator to have a 
pole at k 2 = —ui 2 with residue one. Do your results agree with those 
of part (c)? Should they? 
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15 The One-Loop Correction in 
Lehmann-Kallen Form 

Prerequisite: 14 


In section 13, we found that the exact propagator could be written in 
Lehmann-Kallen form as 


A (fc 2 ) 

1 r°° 1 

= , 2 , 2 • + / dsp(s) 

k z + m- — ie Jim 2 k z + s — ie 

(15.1) 

where the spectral density p{s) is real and nonnegative. In section 14, on 
the other hand, we found that the exact propagator could be written as 


A (h 2 ) — 1 

k 2 + m 2 — ie — n(/c 2 ) 

(15.2) 

and that, to 0(g 2 ) 

in tp 3 theory in six dimensions, 


n (k 2 ) = \a j 

[ dx D \h(D/Dq) — j^a(k 2 + m 2 ) + 0(a 2 ) , 

(15.3) 

where 

a = g 2 / (47t) 3 , 

(15.4) 


D = x(l— x)k 2 + m 2 — ie , 

(15.5) 


Dq = [1— x(l— x)\m 2 . 

(15.6) 


In this section, we will attempt to reconcile eqs. (15.2) and (15.3) with 
eq. (15.1). 

Let us begin by considering the imaginary part of the propagator. We 
will always take k 2 and m 2 to be real, and explicitly include the appropriate 
factors of ie whenever they are needed. 

We can use eq. (15.1) and the identity 

1 x ie 

x — ie x 2 + e 2 x 2 + e 2 

= P — inS(x) , (15-7) 

x 

where P means the principal part, to write 

_ /*oo 

Im A(/c 2 ) = Tr5(k 2 + m 2 ) + / ds p(s) n5(k 2 + s) 

J 4ra 2 

= ir5(k 2 + m 2 ) + 7r p{—k 2 ) , 


(15.8) 
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where p(s) = 0 for s < 4m 2 . Thus we have 

irp(s) = Im A(— s) for s > 4 m 2 . (15.9) 

Let us now suppose that ImII(£: 2 ) = 0 for some range of k 2 . (In section 
14, we saw that the 0(a) contribution to II(&: 2 ) is purely real for k 2 > 
—4m 2 .) Then, from eqs. (15.2) and (15.7), we get 

Im A(fe 2 ) = irS(k 2 + m 2 — II(fc 2 )) for ImII(/c 2 ) = 0 . (15.10) 

From II(— m 2 ) = 0, we know that the argument of the delta function van- 
ishes at k 2 = —m 2 , and from IT(— m 2 ) = 0, we know that the derivative of 
this argument with respect to k 2 equals one at k 2 = —in 2 . Therefore 

Im A.(k 2 ) = ir 5 (k 2 + m 2 ) for ImII(fc 2 ) = 0 . (15.11) 


Comparing this with eq. (15.8), we see that p(—k 2 ) = 0 if ImII(fc 2 ) = 0. 

Now suppose ImII(fc 2 ) is not zero for some range of k 2 . (In section 14, 
we saw that the O(a) contribution to II(/c 2 ) has a nonzero imaginary part 
for k 2 < —4m 2 .) Then we can ignore the ie in eq. (15.2), and 


IrnAfl 2 ) 


Im II(/c 2 ) 

(k 2 + m 2 + ReII(£: 2 )) 2 + (ImII(A; 2 )) 2 


for 


Comparing this with eq. (15.8) we see that 


lmU(k 2 ) + 0 . 

(15.12) 


npU) = ImII (- g ) . 

(— s + m 2 + ReII(— s)) 2 + (ImII(— s)) 2 


(15.13) 


Since we know p(s) = 0 for s < 4m 2 , this tells us that we must also have 
ImII(— s) = 0 for s < 4m 2 , or equivalently ImII(A: 2 ) = 0 for k 2 > —4m 2 . 
This is just what we found for the O(a) contribution to II (A; 2 ) in section 
14. 

We can also see this directly from eq. (15.3), without doing the integral 
over x. The integrand in this formula is real as long as the argument of the 
logarithm is real and positive. From eq. (15.5), we see that D is real and 
positive if and only if x(l—x)k 2 > — m 2 . The maximum value of x{l—x) 
is 1/4, and so the argument of the logarithm is real and positive for the 
whole integration range 0 < x < 1 if and only if k 2 > —4m 2 . In this 
regime, Im \l{k 2 ) = 0. On the other hand, for k 2 < —4m 2 , the argument 
of the logarithm becomes negative for some of the integration range, and 
so ImII(/c 2 ) ^ 0 for k 2 < —4m 2 . This is exactly what we need to reconcile 
eqs. (15.2) and (15.3) with eq. (15.1). 


Problems 
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15.1) In this problem we will verify the result of problem 13.1 to 0(a). 

a) Let ni oop (A; 2 ) be given by the first line of eq. (14.32), with e > 0. 
Show that, up to 0(a 2 ) corrections, 

A = n ioo P (-™ 2 ) • (15.14) 


Then use Cauchy’s integral formula to write this as 


A = 


dw IIioopM 
2ni (w + m 2 ) 2 


(15.15) 


where the contour of integration is a small counterclockwise circle 
around — m? in the complex w plane. 

b) By examining eq. (14.32), show that the only singularity of IIi 00 p(&; 2 ) 
is a branch point at k 2 = —4m 2 . Take the cut to run along the neg- 
ative real axis. 

c) Distort the contour in eq. (15.15) to a circle at infinity with a detour 
around the branch cut. Examine eq. (14.32) to show that, for e > 0, 
the circle at infinity does not contribute. The contour around the 
branch cut then yields 


A 


r-4m 2 


dw 1 
2ni ( w + m 2 ) 2 


Hoop^-Me) - IIi 00 p(u>-ie) , 


(15.16) 


where e is infinitesimal (and is not to be confused with £ = 6 —d). 

d) Examine eq. (14.32) to show that the real part of IIi 00 p('tc) is con- 
tinuous across the branch cut, and that the imaginary part changes 
sign, so that 


nioo P (w+?'e) - ni oop (u;— ie) = -2iImIIi oop (u;-ie) . (15.17) 


e) Let w = — s in eq. (15.16) and use eq. (15.17) to get 
A = -1 r ds . 

7T Tim 2 ( S - m Z Y 

Use this to verify the result of problem 13.1 to 0(a). 


(15.18) 


15.2) Dispersion relations. Consider the exact II(£: 2 ), with e = 0. Assume 
that its only singularity is a branch point at k 2 = —4m 2 , that it obeys 
eq. (15.17), and that II (k 2 ) grows more slowly than |A; 2 | 2 at large \k 2 \. 
By recapitulating the analysis in the previous problem, show that 



ImII(— s— ie) 
( k 2 + s) 3 


(15.19) 
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This is a twice subtracted dispersion relation. It gives n" (A; 2 ) through- 
out the complex k 2 plane in terms of the values of the imaginary part 
of II(£: 2 ) along the branch cut. 
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16 Loop Corrections to the Vertex 

Prerequisite: 14 


Consider the 0(g 3 ) diagram of fig. (16.1), which corrects the ip 3 vertex. In 
this section we will evaluate this diagram. 

We can define an exact three-point vertex function iV 3 {k \ , k 2 , k 3 ) as the 
sum of one-particle irreducible diagrams with three external lines carrying 
momenta ki . k 2j and k 3 , all incoming, with k\ + k 2 + k 3 = 0 by momentum 
conservation. (In adopting this convention, we allow h g to have either sign; 
if ki is the momentum of an external particle, then the sign of kf is positive 
if the particle is incoming, and negative if it is outgoing.) The original 
vertex iZ g g is the first term in this sum, and the diagram of fig. (16.1) is 
the second. Thus we have 

iV 3 (h,k 2 ,k 3 ) = iZ g g + (ig) 3 (\f J -0^ A((£-k 1 ) 2 )A((£+k 2 ) 2 )A(£ 2 ) 

+ 0(g 5 ). (16.1) 

In the second term, we have set Z g = 1 + 0(g 2 ). We proceed immediately 
to the evaluation of this integral, using the series of tricks from section 14. 

First we use Feynman’s formula to write 


A{{£-k,) 2 )A{{£+k 2 ) 2 )A{£ 2 ) 

= I dF 3 x\{l-k\) 2 + x 2 (£+k 2 ) 2 + x 3 f + m 2 


(16.2) 


where 


J dF 3 = 2 J dx\dx 2 dx 3 5{x\+x 2 +x 3 — 1) . (16.3) 

We manipulate the right-hand side of eq. (16.2) to get 
A((£-fe 1 ) 2 )A((£+fc 2 ) 2 )A(£ 2 ) 

= j dF 3 t 2 — 2l-(x\k\ — x 2 k 2 ) + x\ k\ + x 2 k^ + m 2 

= J dF 3 (£ - x 3 ki + x 2 k 2 ) 2 + xi(l-xi)/c 2 + x 2 (l-x 2 )k 2 

,i - 3 

+ 2x 3 x 2 k± ■ k 2 + m 

= j dF 3 [q 2 + D]~ 3 . (16.4) 

In the last line, we have defined q = £ — x\k\ + x 2 k 2l and 


D = xi{l—x\ )k\ + x 2 (l— x 2 )k 2 + 2x\x 2 ki-k 2 + m 2 
= x 3 x\k\ + x 3 x 2 k^ + x\x 2 k 3 + m 2 , 


(16.5) 
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Figure 16.1: The 0(g 3 ) correction to the vertex iV 3 (ki,k 2 ,k 3 ). 


where we used k 3 = (k\ + k 2 ) 2 and x\ + X 2 + x 3 = 1 to simplify the second 
line. 

After making a Wick rotation of the q° contour, we have 


V 3 (fci, k 2 , k 3 )/g = Zg + g 2 j dF 3 j ^ ^ + 0(g 4 ) , (16.6) 

where q is a euclidean vector. This integral diverges for d > 6. We therefore 
evaluate it for d < 6, using the general formula from section 14; the result 


f dd(1 1 = ^ 2 d ) D -(3-d/2) ( 16 7 \ 

J {2n) d (q> + Df 2(4vr) rf / 2 ' K ’ 

Now we set d = 6 — e. To keep g dimensionless, we make the replacement 
g — > gp £ ^ 2 . Then we have 


( ~2\ b / z 

-jj-j + 0(a 2 ) , (16.8) 

where a = g 2 /{ 47r) 3 . Now we can take the e — > 0 limit. The result is 

V 3 (ki,k 2 ,k 3 )/g = Zg + ka “ + J dF ?> ln j + 0(a 2 ) , (16.9) 


where we have used / dF 3 = 1. We now let p 2 = 47re 7 /l 2 , set 


= 1 + C 


(16.10) 


and rearrange to get 

V 3 (fei, k 2 , k 3 )/g = 1 + | a | + In (p/m) + c| 

— \a J dF 3 In (D/m 2 ) 

+ 0(a 2 ) . 


(16.11) 
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If we take C to have the form 

C = —a i + In (p/m) + kc + 0{a 2 ) , (16.12) 

where kc is a purely numerical constant, we get 

V 3 (Aq, Aft, A ; 3 )/<7 = 1 — J dF 3 ln(D/m 2 ) — Kca + 0(a 2 ) . (16.13) 

Thus this choice of C renders V 3 (Aq, Aft , Aft) finite and independent of p, as 
required. 

We now need a condition, analogous to II (— m 2 ) = 0 and II '(— m 2 ) = 0, 
to fix the value of kc- These conditions on Tl{k 2 ) were mandated by known 
properties of the exact propagator, but there is nothing directly comparable 
for the vertex. Different choices of kc correspond to different definitions of 
the coupling g. This is because, in order to measure g , we would measure 
a cross section that depends on g\ these cross sections also depend on kc- 
Thus we can use any value for kc that we might fancy, as long as we all 
agree on that value when we compare our calculations with experimental 
measurements. It is then most convenient to simply set kc = 0. This 
corresponds to the condition 

V 3 (0, 0, 0) = g - (16.14) 

This condition can then also be used to fix the higher-order (in g) terms in 
Z g . 

The integrals over the Feynman parameters in eq. (16.13) cannot be 
done in closed form, but it is easy to see that if (for example) \k 2 \ i$> m 2 , 
then 

V 3 (Aq, Aft, k 3 )/g ~ 1 - \a ln(kl/m 2 ) + 0(1) + 0(a 2 ) . (16.15) 

Thus the magnitude of the one-loop correction to the vertex function in- 
creases logarithmically with \k 2 \ when \k 2 \ 3> m 2 . This is the same behavior 
that we found for U(k 2 )/(k 2 + m 2 ) in section 14. 

Problems 

16.1) Compute the 0(A 2 ) correction to V 3 in <p 4 theory (see problem 9.2) in 
d = 4 — £ spacetime dimensions. Take V 4 = A when all four external 
momenta are on shell, and s = 4m 2 . What is the O(A) contribution 
to Cl 

16.2) Repeat problem 16.1 for the theory of problem 9.3. 
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17 Other 1PI Vertices 

Prerequisite: 16 

In section 16, we defined the three-point vertex function iV;s(k\ . k 2 , hi) as 
the sum of all one-particle irreducible diagrams with three external lines, 
with the external propagators removed. We can extend this definition to 
the ?r-point vertex i\ n {k \, . . . , k n ). 

There are two key differences between V n >3 and V3 in y? 3 theory. The 
first is that there is no tree- level contribution to V n >3. The second is that 
the one-loop contribution to V n >3 is finite for d < 2 n. In particular, the 
one- loop contribution to V n> 3 is finite for d = 6. 

Let us see how this works for the case n = 4. We treat all the external 
momenta as incoming, so that k\ + k 2 + £3 + k 4 = 0. One of the three 
contributing one- loop diagrams is shown in fig. (17.1); in this diagram, the 
k^ vertex is opposite to the k± vertex. Two other inequivalent diagrams are 
then obtained by swapping k% <-» £: 2 and £13 <-+ k 4 . We then have 

r d e f 

*V 4 = 9 J ^A((f-fc 1 ) 2 )A((Rfc 2 ) 2 )A((Rfc 2 +fc 3 ) 2 )A(f 2 ) 

+ (£3 k 2 ) + (&3 k 4 ) 

+ 0(g 6 ). (17.1) 

Feynman’s formula gives 

A(P-£ 1 ) 2 )A(P+fc 2 ) 2 )A(P+fc 2 +fc 3 ) 2 )A(P) 

= j dF 4 xi(£-ki) 2 + x 2 p+fc 2 ) 2 + x 3 (£+k 2 +k 3 ) 2 + x 4 £ 2 + m 2 

= / dF 4 q 2 + Hi 2 34 , (17.2) 

where q = t — x\k\ + x 2 k 2 + x^{k 2 +k^,) and, after making repeated use of 
®i+®2+®3+®4 = 1 and /ci+£; 2 -|-£:3-|-£:4 = 0, 

D\ 2 3 4 = Xix 4 k\ + x 2 x 4 k 2 + x 2 x 3 kl + xix^kj 

+ xix 2 {ki+k 2 ) 2 + x 3 X4(£: 2 -|-£:3) 2 + m 2 . (17.3) 

We see that the integral over q is finite for d < 8, and in particular for 
d = 6. After a Wick rotation of the q° contour and applying the general 
formula of section 14, we find 

f d 6 q 1 i 

J (27 r) 6 ( q 2 + D ) 4 6(47r) 3 D 


(17.4) 
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Figure 17.1: One of the three one- loop Feynman diagrams contributing 
to the four-point vertex iV^ki, & 2 , & 3 , A^); the other two are obtained by 
swapping k 3 and kz fc 4 . 

Thus we get 

Vi = I dFi + D^ + Ab) +0</) ' <17 ' 5) 

This expression is finite and well-defined; the same is true for the one-loop 
contribution to V ra for all n > 3. 

Problems 


17.1) Verify eq. (17.3). 
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18 Higher-Order Corrections and 
Renormalizability 

Prerequisite: 17 

In sections 14-17, we computed the one-loop diagrams with two, three, and 
four external lines for p 3 theory in six dimensions. We found that the first 
two involved divergent momentum integrals, but that these divergences 
could be absorbed into the coefficients of terms in the lagrangian. If this is 
true for all higher-order (in g) contributions to the propagator and to the 
one-particle irreducible vertex functions (with n > 3 external lines), then 
we say that the theory is renormalizable. If this is not the case, and further 
divergences arise, it may be possible to absorb them by adding some new 
terms to the lagrangian. If a finite number of such new terms is required, 
the theory is still said to be renormalizable. However, if an infinite number 
of new terms is required, then the theory is said to be nonrenormalizable. 
Despite the infinite number of parameters needed to specify it, a nonrenor- 
malizable theory is generally able to make useful predictions at energies 
below some ultraviolet cutoff A; we will discuss this in section 29. 

In this section, we will deduce the necessary conditions for renormaliz- 
ability. As an example, we will analyze a scalar held theory in d spacetime 
dimensions of the form 

OO 

C = -^Z^ipd^ip - \Z m m 2 p 2 - Y ^> z n9nV n • (18.1) 

n = 3 

Consider a Feynman diagram with E external lines, I internal lines, L 
closed loops, and V n vertices that connect n lines. (Here V n is just a num- 
ber, not to be confused with the vertex function V n . ) Do the momentum 
integrals associated with this diagram diverge? 

We begin by noting that each closed loop gives a factor of d d £i, and each 
internal propagator gives a factor of l/(p 2 + to 2 ), where p is some linear 
combination of external momenta ki and loop momenta The diagram 
would then appear to have an ultraviolet divergence at large £i if there are 
more f”s in the numerator than there are in the denominator. The number 
of €s in the numerator minus the number of f”s in the denominator is the 
diagram’s superficial degree of divergence 

D = dL-2I , (18.2) 


and the diagram appears to be divergent if 


D > 0 . 


(18.3) 
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Next we derive a more useful formula for D. The diagram has E external 
lines, so another contributing diagram is the tree diagram where all the lines 
are joined by a single vertex, with vertex factor — iZ E g E ; this is, in fact, 
the value of this entire diagram, which then has mass dimension [g E \ ■ (The 
Z's are all dimensionless, by definition.) Therefore, the original diagram 
also has mass dimension [g E \ , since both are contributions to the same 
scattering amplitude: 

[diagram] = \g E ] . (18.4) 

On the other hand, the mass dimension of any diagram is given by the sum 
of the mass dimensions of its components, namely 

OO 

[diagram] = dL — 21 + ^ V n [g n ] . (18.5) 

n = 3 


From eqs. (18.2), (18.4), and (18.5), we get 


D = \g E \ 




n=3 


(18.6) 


This is the formula we need. 

From eq. (18.6), it is immediately clear that if any [g n ] < 0, we expect 
uncontrollable divergences, since D increases with every added vertex of 
this type. Therefore, a theory with any [g n ] < 0 is nonrenormalizable. 

According to our results in section 12, the coupling constants have mass 


dimension 

[< 9n ] = d - \n(d - 2) , 

(18.7) 

and so we have 

2d 

[g n ] <0 if n> d _ 2 - 

(18.8) 


Thus we are limited to powers no higher than <£> 4 in four dimensions, and 
no higher than <^ 3 in six dimensions. 

The same criterion applies to more complicated theories as well: a the- 
ory is nonrenormalizable if any coefficient of any term in the lagrangian 
has negative mass dimension. 

What about theories with couplings with only positive or zero mass 
dimension? We see from eq. (18.6) that the only dangerous diagrams (those 
with D > 0) are those for which [g E \ > 0. But in this case, we can absorb 
the divergence simply by adjusting the value of Z E . This discussion also 
applies to the propagator; we can think of II(&: 2 ) as representing the loop- 
corrected counterterm vertex Ak 2 + Bm 2 , with A and Bm 2 playing the 
roles of two couplings. We have [A] = 0 and [Bm 2 ] = 2, so the contributing 
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Figure 18.1: The one-loop contribution to V4. 




Figure 18.2: A two-loop contribution to V4, and the corresponding coun- 
terterm insertion. 

diagrams are expected to be divergent (as we have already seen in detail), 
and the divergences must be absorbed into A and Bm 2 . 

D is called the superficial degree of divergence because a diagram might 
diverge even if D < 0, or might be finite even if D > 0. The latter can 
happen if there are cancellations among £’s in the numerator. Quantum 
electrodynamics provides an example of this phenomenon that we will en- 
counter in Part III; see problem 62.3. For now we turn our attention to the 
case of diagrams with D < 0 that nevertheless diverge. 

Consider, for example, the diagrams of figs. (18.1) and (18.2). The one- 
loop diagram of fig. (18.1) with E = 4 is finite, but the two-loop correction 
from the first diagram of fig. (18.2) is not: the bubble on the upper prop- 
agator diverges. This is an example of a divergent subdiagram. However, 
this is not a problem in this case, because this divergence is canceled by 
the second diagram of fig. (18.2), which has a counterterm vertex in place 
of the bubble. 

This is the generic situation: divergent subdiagrams are diagrams that, 
considered in isolation, have D > 0. These are precisely the diagrams whose 
divergences can be canceled by adjusting the Z factor of the corresponding 
tree diagram (in theories where [g n ] > 0 for all nonzero g n ). 

Thus, we expect that theories with couplings whose mass dimensions 
are all positive or zero will be renormalizable. A detailed study of the 
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properties of the momentum integrals in Feynman diagrams is necessary 
to give a complete proof of this. It turns out to be true without further 
restrictions for theories that have spin-zero and spin-one-half fields only. 

Theories with spin-one fields are renormalizable for d = 4 if and only if 
these spin-one fields are associated with a gauge symmetry. We will study 
this in Part III. 

Theories of fields with spin greater than one are never renormalizable 
for d > 4. 


Reference Notes 

Explicit two-loop calculations in p 3 theory can be found in Collins , Muta, 
and Sterman. 


Problems 

18.1) In any number d of spacetime dimensions, a Dirac field ^(a;) car- 
ries a spin index a, and has a kinetic term of the form 
where we have suppressed the spin indices; the gamma matrices 7 ^ 
are dimensionless, and T = 'I'^y 0 . 

a) What is the mass dimension [T] of the field T? 

b) Consider interactions of the form g n {^^f) n , where n > 2 is an 
integer. What is the mass dimension [ g n ] of g n l 

c) Consider interactions of the form g m .nP m k) n , where p is a scalar 
field, and m > 1 and n > 1 are integers. What is the mass dimension 
[SVn,n] of gm,nf 

d) In d = 4 spacetime dimensions, which of these interactions are 
allowed in a renormalizable theory? 
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19 Perturbation Theory to All Orders 

Prerequisite: 18 

In section 18, we found that, generally, a theory is renormalizable if all 
of its lagrangian coefficients have positive or zero mass dimension. In this 
section, using ip 3 theory in six dimensions as our example, we will see how 
to construct a finite expression for a scattering amplitude to arbitrarily 
high order in the p 3 coupling g. 

We begin by summing all one-particle irreducible diagrams with two 
external lines; this gives us the self-energy n(/c 2 ). We next sum all 1PI 
diagrams with three external lines; this gives us the three-point vertex 
function V^Aq , Aq, hi). Order by order in g , we must adjust the value 
of the lagrangian coefficients Z^, Z m , and Z g to maintain the conditions 
n(— m 2 ) = 0, n'(-m 2 ) = 0, and V 3 ( 0 , 0,0) = g. 

Next we will construct the n-point vertex functions V n (Aq , . . . , k n ) with 
4 < n < E, where E is the number of external lines in the process of 
interest. We compute these using a skeleton expansion. This means that 
we draw all the contributing 1PI diagrams, but omit diagrams that include 
either propagator or three-point vertex corrections. That is, we include 
only diagrams that are not only 1PI, but also 2PI and 3PI: they remain 
connected when any one, two, or three lines are cut. (Cutting three lines 
may isolate a single tree- level vertex, but nothing more complicated.) Then 
we take the propagators and vertices in these diagrams to be given by the 
exact propagator A (k 2 ) = ( k 2 + m 2 — n(/c 2 )) -1 and vertex 'V:i(k\ . k- 2 , fcs), 
rather than by the tree-level propagator A(k 2 ) = ( k 2 + m 2 )^ 1 and vertex 
g. We then sum these skeleton diagrams to get V n for 4 < n < E. Order 
by order in g, this procedure is equivalent to computing V n by summing 
the usual set of contributing 1PI diagrams. 

Next we draw all tree-level diagrams that contribute to the process 
of interest (which has E external lines), including not only three-point 
vertices, but also n-point vertices for n = 3, 4, . . . , E. Then we evaluate 
these diagrams using the exact propagator A (A; 2 ) for internal lines, and 
the exact 1PI vertices V n ; external lines are assigned a factor of one. 1 We 
sum these tree diagrams to get the scattering amplitude; loop corrections 
have all been accounted for already in A (A: 2 ) and V n . Order by order in 
g , this procedure is equivalent to computing the scattering amplitude by 
summing the usual set of contributing diagrams. 

Thus we now know how to compute an arbitrary scattering amplitude 

1 This is because, in the LSZ formula, each Klein-Gordon wave operator becomes (in 
momentum space) a factor of kj + m 2 that multiplies each external propagator, leaving 
behind only the residue of the pole in that propagator at k 2 = — m 2 ; by construction, 
this residue is one. 
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to arbitrarily high order. The procedure is the same in any quantum field 
theory; only the form of the propagators and vertices change, depending 
on the spins of the fields. 

The tree-level diagrams of the final step can be thought of as the Feyn- 
man diagrams of a quantum action (or effective action , or quantum effective 
action) T(y?). There is a simple and interesting relationship between the ef- 
fective action T(y>) and the sum of connected diagrams with sources iW ( J). 
We derive it in section 21. 


Reference Notes 

The detailed procedure for renormalization at higher orders is discussed in 
Coleman , Collins, Muta, and Sterman. 
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20 Two-Particle Elastic Scattering at One 

Loop 

Prerequisite: 19 


We now illustrate the general rules of section 19 by computing the two- 
particle elastic scattering amplitude, including all one-loop corrections, in 
ip 3 theory in six dimensions. Elastic means that the number of outgoing 
particles (of each species, in more general contexts) is the same as the 
number of incoming particles (of each species). 

We computed the amplitude for this process at tree level in section 10, 
with the result 


iT tree = -Aig) 2 A(-s) + A (-t) + A (-u) 


( 20 . 1 ) 


where A(— s) = l /(— s + m 2 — ie) is the free-held propagator, and s, f, and 
u are the Mandelstam variables. Later we will need to remember that s is 
positive, that t and u are negative, and that s + t + u = 4m 2 . 

The exact scattering amplitude is given by the diagrams of fig. (20.1), 
with all propagators and vertices interpreted as exact propagators and ver- 
tices. (Recall, however, that each external propagator contributes only the 
residue of the pole at k 2 = — m 2 , and that this residue is one; thus the fac- 
tor associated with each external line is simply one.) We get the one- loop 
approximation to the exact amplitude by using the one-loop expressions 
for the internal propagators and vertices. We thus have 

VT \- loop = i([iV 3 (s)] 2 A(— a) + [*V 3 (f)] 2 A(— t) + [iV 3 (u)] 2 A(-u)) 

+ *V 4 (s,t,it) , (20.2) 


where, suppressing the ie' s, 

1 


A(-s) = 


— s + m 2 — II(— s) 

II(— s) = \ a j dxD 2 {s)\n( K D 2 (s)/Do s j — j^a(—s + m 2 ) , 

V 3 (s)/g = 1 - \a j dF 3 In (T 3 (s)/m 2 ) , 

V 4 (s,t,it) = \g 2 a J dF 4 


1 1 

+ 


DAs,t) DAt,u ) D^u, s)J 


Here a = <7 2 /(47r) 3 , the Feynman integration measure is 

J dF n f{x) = (n-l)\ J dx l ...dx n S(xi+...+x n -l)f(x) 


(20.3) 

(20.4) 

(20.5) 

(20.6) 
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r 1 rl-xi ri-x i-...-x n -2 

= (n— 1)! / dx i / dx 2 ■ ■ ■ / dx n _ 4 

Jo Jo Jo 


l—Xl—...—X n —2 


to Jo 

xf(x 


X n = l — Xl — ...—X n -l 


and we have defined 


(20.7) 


D 2 (s) = —x{l —x)s + m 2 , (20.8) 

Do = +[1— x(l— x)]m 2 , (20.9) 

D 3 (s) = -xix 2 s + [l-(x 4 +x 2 )x 3 ]m 2 , (20.10) 

D 4 (s,f) = — x 4 x 2 s — x 3 x 4 f + [1— (xi+x 2 )(x 3 +X4)]m 2 . (20.11) 


We obtain V 3 (s) from the general three-point function V 3 (fc 4 , /c 2 , fc 3 ) by set- 
ting two of the three k 2 to — m 2 , and the third to — s. We obtain V 4 (s, t, u) 
from the general four-point function V 4 (fc 4 , . . . , fc 4 ) by setting all four k'f 
to — m 2 , {k\ + /c 2 ) 2 to — s, (k\ + k 3 ) 2 to —t, and (k\ + /c 4 ) 2 to —u. (Recall 
that the vertex functions are defined with all momenta treated as incoming; 
here we have identified — fc 3 and — fc 4 as the outgoing momenta.) 

Eqs. (20.2-20.11) are formidable expressions. To gain some intuition 
about them, let us consider the limit of high-energy, fixed angle scattering, 
where we take s, |t|, and |u| all much larger than m 2 . Equivalently, we are 
considering the amplitude in the limit of zero particle mass. 

We can then set m 2 = 0 in D 2 (s), D 3 (s), and D 4 (s,f). For the self- 
energy, we get 


n (-s) = - 


\a s 


dxx( 1— x) 


in Hi 


+ In 


x(l — x) 

1— x(l— x) )\ 


+ -^a s 


12 


a s 


In (—s/m 2 ) + 3 — 7tv^3 


(20.12) 


Thus, 


A(-s) 


1 

-s-n(-s) 


- — + ln(— s/m 2 ) + 3 — ttVs ^ + 0{a 2 ) . (20.13) 


The appropriate branch of the logarithm is found by replacing s by s + ie. 
For s real and positive, — s lies just below the negative real axis, and so 


ln(— s) = Ins — in . (20.14) 

For t (or u), which is negative, we have instead 

ln(— t) = ln|f| , 

Ini = In \t\ + in . 


(20.15) 



20: Two-Particle Elastic Scattering at One Loop 


137 



Figure 20.1: The Feynman diagrams contributing to the two-particle elastic 
scattering amplitude; a double line stands for the exact propagator iA (k), 
a circle for the exact three-point vertex Vs(Aq, A’ 2 , k 3 ), and a square for the 
exact four-point vertex V^Aq, & 2 , Aft, Aq). An external line stands for the 
unit residue of the pole at k 2 = —m 2 . 
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For the three-point vertex, we get 


V 3 (s)/<7 = 1 — ia / dF 3 In (-s/m 2 ) + ln(xi.x 2 ) 


= 1 — ia 


In {—s/m 2 ) — 3 


(20.16) 


where the same comments about the appropriate branch apply. 

For the four-point vertex, the integral over the Feynman parameters 
can be done in closed form, with the result 


dF 4 


D 4 (.s,t) 


s + t 


TT 2 + 


ln(s/i) 


= + - [ vr" + 
u 


In (s/t) 


1 2 


where the second line follows from s + t + u = 0. 
Putting all of this together, we have 


% 


1 — loop — 9 


F(s , t, u) + F(t, u, s ) + F(u, s, t ) 


(20.17) 


(20.18) 


where 

F(s,t,u) 


- ( 1 - T h a 


In (—s/m 2 ) + c 


- 2« 


ln(t/u) 


(20.19) 


and c = (67t 2 + iry/3 — 39) /II = 2.33. This is a typical result of a loop 
calculation: the original tree-level amplitude is corrected by powers of log- 
arithms of kinematic variables. 


Problems 


20.1) Verify eq. (20.17). 

20.2) Compute the O(a) correction to the two-particle scattering amplitude 
at threshold , that is, for s = 4?n 2 and t = u = 0, corresponding to 
zero three-momentum for both the incoming and outgoing particles. 
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21 The Quantum Action 

Prerequisite: 19 


In section 19, we saw how to compute (in p 3 theory in d = 6 dimensions) the 
1PI vertex functions V n (ki , . . . , k n ) for n > 4 via the skeleton expansion : 
draw all Feynman diagrams with n external lines that are one-, two-, and 
three- particle irreducible, and compute them using the exact propagator 
A(k 2 ) and three-point vertex function V3(fci, &2, £3). 

We now define the quantum action (or effective action , or quantum 
effective action ) 


7A5 ft- k) f 2 + - n(fc! 


+ £ 

n = 3 


(: 2ir) d x 

}_ _ l jjh (rt^dxd. 

n\ 


(2TT) d 8 d (k\-\- . . . -\-k n ) 


( 2n) d ( 2n) d 

xV„(fci, <£(fci) . . . ^(A 


( 21 . 1 ) 


where <^(fc) = / d d x e~ ikx ip(x) . The quantum action has the property that 
the tree-level Feynman diagrams it generates give the complete scattering 
amplitude of the original theory. 

In this section, we will determine the relationship between r(<yj) and the 
sum of connected diagrams with sources, iW(J), introduced in section 9. 
Recall that W ( J) is related to the path integral 


Z(J) = J Dp exp iS(<p ) T iJ d d xJp 


where S = f d d x C is the action, via 

Z{J) = exp [iW{J)\ . 
Consider now the path integral 

Zp(J) = j Vp exp iT(p) + */ d d xJp 


( 21 . 2 ) 


(21.3) 


(21.4) 


= exp[iWr(J)] . 


(21.5) 


Wr(J) is given by the sum of connected diagrams (with sources) in which 
each line represents the exact propagator, and each n-point vertex rep- 
resents the exact 1PI vertex V n . Wr(J) would be equal to W(J) if we 
included only tree diagrams in IFp(J). 
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We can isolate the tree-level contribution to a path integral by means of 
the following trick. Introduce a dimensionless parameter that we will call 
h, and the path integral 

Zr,n{J) = J ex P ( r (^) + J d d xJ(p^j (21.6) 

= exp[iWr, R (J)] . (21.7) 

In a given connected diagram with sources, every propagator (including 
those that connect to sources) is multiplied by h. every source by 1 /h, and 
every vertex by 1/h. The overall factor of h is then fi p ^ E ^ v , where V is the 
number of vertices, E is the number of sources (equivalently, the number of 
external lines after we remove the sources), and P is the number of prop- 
agators (external and internal). We next note that P—E—V is equal to 
L— 1, where L is the number of closed loops. This can be seen by counting 
the number of internal momenta and the constraints among them. Specif- 
ically, assign an unfixed momentum to each internal line; there are P—E 
of these momenta. Then the V vertices provide V constraints. One lin- 
ear combination of these constraints gives overall momentum conservation, 
and so does not constrain the internal momenta. Therefore, the number of 
internal momenta left unfixed by the vertex constraints is (P—E)—(V—1), 
and the number of unfixed momenta is the same as the number of loops L. 

So, W r ,n(J) can be expressed as a power series in h of the form 

OO 

W r ,n(J) = E nL ~ l w Tl(J) ■ (21.8) 

L = 0 

If we take the formal limit of h — » 0, the dominant term is the one with 
L = 0, which is given by the sum of tree diagrams only. This is just what 
we want. We conclude that 

W{J) = Wr, L =o(J) • (21.9) 

Next we perform the path integral in eq. (21.6) by the method of station- 
ary phase. We find the point (actually, the held configuration) at which 
the exponent is stationary; this is given by the solution of the quantum 
equation of motion 

= -jm . (2Lio) 

Let <y?j(x) denote the solution of eq. (21.10) with a specified source function 
J(x). Then the stationary-phase approximation to Z-p.n(J) is 

jr ( r (<A/) + Jd d xJtpj^j + O(h 0 ) . 


Zr,n( J ) = ex P 


( 21 . 11 ) 
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Combining the results of eqs. (21.7), (21.8), (21.9), and (21.11), we find 

W(J) = T{tpj) + J d d x Jtpj . (21.12) 


This is the main result of this section. 

Let us explore it further. Recall from section 9 that the vacuum expec- 
tation value of the field operator ip(x) is given by 




(21.13) 


Now consider what we get if we do not set J = 0 after taking the derivative: 




(21.14) 


This is the vacuum expectation value of ip{x) in the presence of a nonzero 
source function J{x). We can get some more information about it by using 
eq. (21.12) for W(J). Making use of the product rule for derivatives, we 
have 

(0|^(x)|0)j = -r^yT(99j) + (^j(x) + J d6 y J (y) 6 £j^ • (21.15) 


We can evaluate the first term on the right-hand side by using the chain 
rule, 



£r(y?j) 6ipj(y) 
5ipj(y) 6J(x) 


(21.16) 


Then we can combine the first and third terms on the right-hand side of 
eq. (21.15) to get 


<0|¥>(x)|0)j 



~ gX^j) 
j(y ) 


+ J(y) 


5yj(y) 

SJ(x) 


+ <pj{x) • 


(21.17) 


Now we note from eq. (21.10) that the factor in large brackets on the right- 
hand side of eq. (21.17) vanishes, and so 


(0^(s)|0)j = ^j(x). (21.18) 


That is, the vacuum expectation value of the field operator (p(x) in the 
presence of a nonzero source function is also the solution to the quantum 
equation of motion, eq. (21.10). 

We can also write the quantum action in terms of a derivative expansion , 


r(¥>) 



U(ip) - + . . . 


(21.19) 
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where the ellipses stand for an infinite number of terms with more and more 
derivatives, and 14 (<p) and Z(tp) are ordinary functions (not functionals) 
of <p(x). U (p) is called the quantum potential (or effective potential , or 
quantum effective potential ), and it plays an important conceptual role in 
theories with spontaneous symmetry breaking; see section 31. However, it 
is rarely necessary to compute it explicitly, except in those cases where we 
are unable to do so. 


Reference Notes 


Construction of the quantum action is discussed in Coleman , Itzykson & 
Zuber , Peskin & Schroeder , and Weinberg II. 

Problems 


21.1) Show that 

r(p) = w(j v ) - J d d x j v p 

where J lf (x) is the solution of 

(5 


6J(x 


- W{J ) = p(x) 


(21.20) 


( 21 . 21 ) 


for a specified p{x). 

21.2) Symmetries of the quantum action. Suppose that we have a set of 
fields p a {x), and that both the classical action S(p) and the integra- 
tion measure Dp are invariant under 

Ta(x) J d d y R ab (x,y)p b (y) (21.22) 

for some particular function R a b(x,y). Typically R a b{x,y ) is a con- 
stant matrix times 5 d (x—y), or a finite number of derivatives of 
5 d (x—y ); see sections 22, 23, and 24 for some examples. 

a) Show that W ( J ) is invariant under 

J a (x) ->• j d d y J b (y)R ba (y,x) . (21.23) 

b) Use eqs. (21.20) and (21.23) to show that the quantum action T(<£>) 
is invariant under eq. (21.22). This is an important result that we will 
use frequently. 
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21.3) Consider performing the path integral in the presence of a background 
field ip(x); we define 


exp [iW(J;f)\ = / XV exp 


iS(ip+ip) + i J d d x Jip 


(21.24) 


Then W(J; 0) is the original W(J) of eq. (21.3). We also define the 
quantum action in the presence of the background field, 

TV; ip) = W(J V ] <P)~ J dd x J<pV , (21.25) 

where J ip ( x) is the solution of 

J ^W{J-,<p) = <p(x) (21.26) 

for a specified tp(x). Show that 

TV; ip) = T (ip+ip; 0) , (21.27) 

where T(<^; 0) is the original quantum action of eq. (21.1). 
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22 Continuous Symmetries and Conserved 

Currents 

Prerequisite: 8 


Suppose we have a set of scalar fields <p a (x), and a lagrangian density 
C(x) = C((p a (x),d l j,ip a (x)). Consider what happens to C{x) if we make 
an infinitesimal change <p a (x) — ► q? a (x) + 8<Pa(x) in each field. We have 
C(x) — > C{x) + 6C(x), where 5C(x) is given by the chain rule, 

{£(I) = + 9(9^W) ' <22 ' 1) 

Next consider the classical equations of motion (also known as the Euler- 
Lagrange equations, or the field equations ), given by the action principle 


_SS_ 

S(p a (x) 


= 0, 


( 22 . 2 ) 


where S = / dry C{y) is the action, and 5/5ip a {x) is a functional derivative. 
(For definiteness, we work in four spacetime dimensions, though our re- 
sults will apply in any number.) We have (with repeated indices implicitly 
summed) 


dpa{X) J 0<p a (x) 



9C{y) 6ip b (y) dC{y) 5{d^ b {y)) 

d (pb{y) dcp a (x) d(d^(p b (y)) 6 <p a (x) 


= jd% 
dc(x 


9£(y) r ,4, \ , dC (y) Jf n ^ 

ObaO {y-x) + ObaO/xd {y-x) 


d<Pb(y) 


d{d^b(y)) 


,, dC[x) 

d<p a (x) M d{d^ a {x)) ' 


We can use this result to make the replacement 


8C{x) 


d „ 


dC{x) 


+ 


5S 


dip a (x) M d(dfj,ip a (x)) ' 5ip a (x) 
in eq. (22.1). Then, combining two of the terms, we get 


5C{x) = d, 


l dC{x) 

\d(d^a(x)) 


$<Pa(x) + 


55 


S<p a (x[ 


-5(f a {x) 


(22.3) 


(22.4) 


(22.5) 
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Next we identify the object in large parentheses in eq. (22.5) as the Noether 
current 


f(x) 


dC{x) 
d(d nVa(x)) 


S<Pa(x ) • 


( 22 . 6 ) 


Eq. (22.5) then implies 


d^fix) = SC(x) 


5S 

5ip a (x) 


$<Pa{x) • 


(22.7) 


If the classical field equations are satisfied, then the second term on the 
right-hand side of eq. (22.7) vanishes. 

The Noether current plays a special role if we can find a set of in- 
finitesimal field transformations that leaves the lagrangian unchanged, or 
invariant. In this case, we have 5L = 0, and we say that the lagrangian has 
a continuous symmetry. From eq. (22.7), we then have = 0 whenever 
the field equations are satisfied, and we say that the Noether current is 
conserved. In terms of its space and time components, this means that 

^j°(x) + V • j(x) = 0 . (22.8) 


If we interpret j°(x) as a charge density , and j(x) as the corresponding cur- 
rent density , then eq. (22.8) expresses the local conservation of this charge. 

Let us see an example of this. Consider a theory of a complex scalar 
field with lagrangian 


C = -d^d^ — m 2 ip*(p — \X (<£>V) 2 . (22.9) 


We can also rewrite C in terms of two real scalar fields by setting ip = 
(</?i + ip 2 ) A/2 to get 

C = -\dVwid^ l -\dVw2d^2-\m 2 {v\ + vl)--k X &\ + l pl) 2 ■ (22.10) 


In the form of eq. (22.9), it is obvious that C is left invariant by the trans- 
formation 


ip(x) e ia y(x) , 


( 22 . 11 ) 


where a is a real number. This is called a U(l) transformation , a transfor- 
mation by a unitary lxl matrix. In terms of (pi and , this transformation 
reads 


/ 

\<P2(x)J 


cos a sm a 


> i(x) 


— sm a cos a , 


.<P2(x) 


( 22 . 12 ) 


If we think of (^ 1 ,(^ 2 ) as a two-component vector, then eq. (22.12) is just 
a rotation of this vector in the plane by angle a. Eq. (22.12) is called an 
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SO (2) transformation , a transformation by an orthogonal 2x2 matrix with 
a special value of the determinant (namely +1, as opposed to — 1, the only 
other possibility for an orthogonal matrix). We have learned that a U(l) 
transformation can be mapped into an S0(2) transformation. 

The infinitesimal form of eq. (22.11) is 

tp(x) — > <p(x) — iaip(x) , 

<p\x) — > p\x) + iotip 1 (x) , (22.13) 

where a is now infinitesimal. In eq. (22.6), we should treat p and ip' as in- 
dependent fields. It is also conventional to scale the infinitesimal parameter 
out of the current, so that we have 

dc . dc 

aJ d(d„(p) ip+ d(d fJ ,p t) 

= (— d^(p*)(— icup) + (— d fJ ’ip)(+iap ! ) 

= aim (ip^d^ip) , (22.14) 

where Ad ,J B = Ad^B — (d^ l A)B. Canceling out a, we find that the Noether 
current is 

j -/i = Im (ip^d^p) . (22.15) 

We can also repeat this exercise using the SO(2) form of the trans- 
formation. For infinitesimal a, eq. (22.12) becomes 8p\ = +ap 2 and 
Sip 2 = —oap\. Then the Noether current is given by 

dC . d£ . 

aj d(d^p i) + d(d^p 2 ) 992 

= (-d^pi)(+ap 2 ) + (-d A V 2)(-atpi) 

= a(p 1 d^p 2 ), (22.16) 

which is (hearteningly) equivalent to eq. (22.14). 

Let us define the Noether charge 

Q = J d 3 xj°(x ) = J d 3 x , (22.17) 

and investigate its properties. If we integrate eq. (22.8) over d 3 x , use Gauss’s 
law to write the volume integral of V • j as a surface integral, and assume 
that the boundary conditions at infinity fix j(x) = 0 on that surface, then 
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we find that Q is constant in time. To get a better idea of the physical 
implications of this, let us rewrite Q using the free-held expansions 

cp(x) = fdk [a(k)e ikx + b*(k)e~ ikx ] , 

v\x) = J dk[b(k)e ikx + a*(k)e~ ikx \ . (22.18) 

We have written a*( k) and 6*( k) rather than a^(k) and 6^(k) because so 
far our discussion has been about the classical held theory. In a theory 
with interactions, these formulae (and their hrst time derivatives) are valid 
at any one particular time (say, t = — oo). Then, we can plug them into 
eq. (22.17), and hnd (after some manipulation similar to what we did for 
the hamiltonian in section 3) 

Q = j dk [o*(k)a(k) - 6(k)6*(k)] . (22.19) 

In the quantum theory, this becomes an operator that counts the number 
of a particles minus the number of b particles. This number is then time- 
independent, and so the scattering amplitude vanishes identically for any 
process that changes the value of Q. This can be seen directly from the 
Feynman rules, which conserve Q at every vertex. 

To better understand the implications of the Noether current in the 
quantum theory, we begin by considering the path integral, 

Z(J) = J Vipe i[S+ I d4yJaipa] . (22.20) 

The value of Z(J) is unchanged if we make the change of variable ip a {x) — > 
( p a {x ) + 5<p a (x), with 5(p a (x) an arbitrary infinitesimal shift that (we as- 
sume) leaves the measure T>: p invariant. Thus we have 

0 = SZ(J) 

= i J d 4 x + <W) 8ip a {x) . (22.21) 

We can now take n functional derivatives with respect to J aj (xj), and then 
set J = 0, to get 

/ f ss 

Vipe lS J d A x i Va 1 (x 1 )...<p an {x n ) 

n 

+ T’ai^l) • • • S aaj 6 4 (x-Xj) . . . (Pa n ( X n ) 

3 = 1 


5ip a {x). ( 22 . 22 ) 
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Since 5tp a (x) is arbitrary, we can drop it (and the integral over d 4 x). Then, 
since the path integral computes the vacuum expectation value of the time- 
ordered product, we have 

s s 

0 = TT < Au( a h) • • • ValXn) |0) 

0(Pa{x) 

n 

+ ^(0|T<^ ai (a;i) . . . 5 aaj 5 4 (x-Xj) . . . (pajx n ) |0) . (22.23) 

3 = i 

These are the Schwinger-Dyson equations for the theory. 

To get a feel for them, let us look at free-held theory for a single real 
scalar held, for which 5S/5<p(x) = (d 2 — m 2 )^>{x). For n = 1 we get 

(— d 2 + m 2 )i(0|T99(x)< / ?(xi)|0) = 5 4 (x—x i) . (22.24) 

That the Klein-Gordon wave operator should sit outside the time-ordered 
product (and hence act on the time-ordering step functions) is clear from 
the path integral form of eq. (22.22). We see from eq. (22.24) that the free- 
held propagator, A(x— x{) = i(0|T(^(x)<^(a;i)|0), is a Green’s function for 
the Klein-Gordon wave operator, a fact we first learned in section 8. 

More generally, we can write 

s s 

(0|T- — <p ai {xi) ■ ■ • <AijTn)|0) = 0 for x ^ x\ n . (22.25) 

d:p a {x) 

We see that the classical equation of motion is satished by a quantum held 
inside a correlation function, as long as its spacetime argument differs from 
those of all the other helds. When this is not the case, we get extra contact 
terms. 

Let us now consider a theory that has a continuous symmetry and a 
corresponding Noether current. Take eq. (22.22), and set 5(p a (x) to be the 
infinitesimal change in y> a {x) that results in 5C(x) = 0. Now sum over the 
index a, and use eq. (22.7). The result is the Ward (or Ward-Takahashi ) 
identity 

0 = d^\Tf(x)y ai (xi) . . . ip an ( z„)|0) 
n 

+ i ^(0|T<£> ai (.Ti) . . . 5ip aj (x)5 4 ( x-Xj) . . . <^ a „(.T n )|0) . (22.26) 

j=i 

Thus, conservation of the Noether current holds in the quantum theory, 
with the current inside a correlation function, up to contact terms with a 
specific form that depends on the details of the infinitesimal transformation 
that leaves £ invariant. 
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The Noether current is also useful in a slightly more general context. 
Suppose we have a transformation of the fields such that 5C(x) is not zero, 
but instead is a total divergence: 6C(x) = for some K^(x). Then 

there is still a conserved current, now given by 

fix) = dC[x) 5Mx) - K»(x) . (22.27) 

d{d^ip a [x)) 

An example of this is provided by the symmetry of spacetime translations. 
We transform the fields via tp a {x ) — > ip a (x — a), where a ^ is a constant four- 
vector. The infinitesimal version of this is tp a {x) Va{ x ) — a u d u ^> a (x), and 
so we have Sip a (x) = —a v d v y> a (x). Under this transformation, we obviously 
have C(x) —> C(x — a), and so 5C(x) = — a v d v L{x) = — d v {a v C{x)). Thus 
in this case K v {x) = —a u C(x), and the conserved current is 

nx) = m£ 

= a v T^(x) , (22.28) 

where we have defined the stress-energy or energy-momentum tensor 

T^(x) EE - dC{x) d\ a {x) + <TC{x) . (22.29) 

d{d^<Pa{X)) 

For a renormalizable theory of a set of real scalar fields <p a (x), the 
lagrangian takes the form 



C = -V(<p) , 

(22.30) 

where V{ip) is 

a polynomial in the (p a ’s- In this case 



T ,JU = di'pad^a + g^C . 

(22.31) 

In particular, 

T °° = k U l + l(V^ 0 ) 2 + V((p) , 

(22.32) 


where II a = do<p a is the canonical momentum conjugate to the field ip a - 
We recognize T 00 as the Hamiltonian density H that corresponds to the 
lagrangian density of eq. (22.30). Then, by Lorentz symmetry, T 0j must be 
the corresponding momentum density. We have 

T 0j = dPtpa&tpa = -n a v j cp a . (22.33) 

To check that this is a sensible result, we use the free- field expansion for 
a set of real scalar fields [the same as eq. (22.18) but with 6(k) = a(k) for 
each field]; then we find that the momentum operator is given by 

= j d?xT°\x) = I dk y a£(k)a 0 (k) , 


(22.34) 
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which is just what we would expect. We therefore identify the energy- 
momentum four-vector as 


P p = J d 3 xT 0p (x) . (22.35) 

Recall that in section 2 we defined the spacetime translation operator 
as 

T(a) = exp(— iP p a p ) , (22.36) 

and announced that it had the property that 

T(a)- l ip a (x)T(a) = ip a (x - a) . (22.37) 

Now that we have an explicit formula for P p , we can check this. This is 
easiest to do for infinitesimal a then eq. (22.37) becomes 

[M*),P tl ] = ?d»<Pa( x). (22.38) 

This can indeed be verified by using the canonical commutation relations 
for ip a (x) and n a (x). 

One more symmetry we can investigate is Lorentz symmetry. If we make 
an infinitesimal Lorentz transformation, we have ip a (x ) — > ip a (x + 5 lo ■ x), 
where buj-x is shorthand for Su u p x p . This case is very similar to that of 
spacetime translations; the only difference is that the translation parameter 
a v is now x dependent, a u — » — 6lo u p x p . The resulting conserved current is 

M pvp [x) = x v T pp {x) - x p T pv {x) , (22.39) 

and it obeys d p M pup = 0, with the derivative contracted with the first 
index. Jv[ pup is antisymmetric on its second two indices; this comes about 
because Sco up is antisymmetric. The conserved charges associated with this 
current are 

M vp = J d 3 xM 0up (x ) , (22.40) 

and these are the generators of the Lorentz group that were introduced 
in section 2. Again, we can use the canonical commutation relations for 
the fields to check that the Lorentz generators have the right commutation 
relations, both with the fields and with each other. 

Reference Notes 

The path-integral approach to Ward identities is treated in more de- 
tail in Peskin & Schroeder. An operator-based derivation can be found in 
Weinberg I. 
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Problems 

22.1) For the Noether current of eq. (22.6), and assuming that 5ip a does not 
involve time derivatives, use the canonical commutation relations to 
show that 

\<Pa,Q\=i8<Pa-$ (22.41) 

where Q is the Noether charge. 

22.2) Use the canonical commutation relations to verify eq. (22.38). 

22.3) a) With T^ u given by eq. (22.31), compute the equal-time (x° = y°) 
commutators [T 00 (x), T 00 (y)], [T°*(x), T 00 (y)], and [T 0l (x),T 0j '(j/)]. 

b) Use your results to verify eqs. (2.17), (2.19), and (2.20). 



23: Discrete Symmetries: P, T, C , and Z 


152 


23 Discrete Symmetries: P, T , C , and Z 

Prerequisite: 22 


In section 2, we studied the proper orthochronous Lorentz transformations, 
which are continuously connected to the identity. In this section, we will 
consider the effects of parity , 


= {v- l Yu 


f +1 \ 

-l 

-l 

V -1/ 


and time reversal, 


ty = (j- x y v 


/-i 

+i 

V 


\ 

+i 

+i/ 


(23.1) 


(23.2) 


We will also consider certain other discrete transformations such as charge 
conjugation. 

Recall from section 2 that for every proper orthochronous Lorentz trans- 
formation A /i „ there is an associated unitary operator U (A) with the prop- 
erty that 

U{ A) - V(x)J7(A) = <p( A"D) . (23.3) 

Thus for parity and time-reversal, we expect that there are corresponding 
unitary operators 


P = U(P) , (23.4) 

T = U{T) , (23.5) 

such that 

P~ l :p{x)P = (p{Vx ) , (23.6) 

T“V(x)T = ¥>(Tx) . (23.7) 

There is, however, an extra possible complication. Since the V and 
T matrices are their own inverses, a second parity or time-reversal trans- 
formation should transform all observables back into themselves. Using 
eqs. (23.6) and (23.7), along with V 2 = 1 and T 2 = 1, we see that 

P~ 2 ip{x)P 2 = ip{x) , 

T- 2 ip(x)T 2 = ip(x) . 


(23.8) 

(23.9) 
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Since c p(x ) is a hermitian operator, it is in principle an observable, and so 
eqs. (23.8) and (23.9) are just what we expect. However, another possibility 
for the parity transformation of the field, different from eqs. (23.6) and 
(23.7) but nevertheless consistent with eqs. (23.8) and (23.9), is 

P~ 1 ^(x)P = —:p(Vx) , (23.10) 

T“ 1 (^(x)T = —(p{Tx) . (23.11) 

This possible extra minus sign cannot arise for proper orthochronous Lorentz 
transformations, because they are continuously connected to the identity, 
and for the identity transformation (that is, no transformation at all), we 
must obviously have the plus sign. 

If the minus sign appears on the right-hand side, we say that the field 
is odd under parity (or time reversal) . If a scalar field is odd under parity, 
we sometimes say that it is a pseudoscalar. 1 

So, how do we know which is right, eqs. (23.6) and (23.7), or eqs. (23.10) 
and (23.11)? The general answer is that we get to choose, but there is a 
key principle to guide our choice: if at all possible, we want to define P 
and T so that the lagrangian density is even, 

P~ l C{x)P = +C(Vx ) , (23.12) 

T- 1 £(x)T = +C(Tx) . (23.13) 

Then, after we integrate over d 4 x to get the action S, the action will be 
invariant. This means that parity and time-reversal are conserved. 

For theories with spin-zero fields only, it is clear that the choice of 
eqs. (23.6) and (23.7) always leads to eqs. (23.12) and (23.13), and so there is 
no reason to flirt with eqs. (23.10) and (23.11). For theories that also include 
spin-one-half fields, certain scalar bilinears in these fields are necessarily odd 
under parity and time reversal, as we will see in section 40. If a scalar field 
couples to such a bilinear, then eqs. (23.12) and (23.13) will hold if and only 
if we choose eqs. (23.10) and (23.11) for that scalar, and so that is what we 
must do. 

There is one more interesting fact about the time-reversal operator T: 
it is antiunitary , rather than unitary. Antiunitary means that T~ l iT = —i. 

To see why this must be the case, consider a Lorentz transformation of 
the energy-momentum four-vector, 

t/(A) -1 P M t/(A) = A » V P V . (23.14) 

Ht is still a scalar under proper orthochronous Lorentz transformations; that is, 
eq. (23.3) still holds. Thus the appellation scalar often means eq. (23.3), and either 
eq. (23.6) or eq. (23.10), and that is how we will use the term. 
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For parity and time-reversal, we therefore expect 

p-ipup = ; (23.15) 

T -i p ti T = T^ U P U . (23.16) 

In particular, for fi = 0, we expect P~ 1 HP = +H and T~ l HT = —H. 
The first of these is fine; it says the hamiltonian is invariant under parity, 
which is what we want. 2 However, eq. (23.16) is a disaster: it says that the 
hamiltonian is invariant under time-reversal if and only if H = —H, which 
is possible only if H = 0. 

Can we just put an extra minus sign on the right-hand side of eq. (23.16), 
as we did for eq. (23.11)? The answer is no. We constructed P^ L explicitly 
in terms of the fields in section 22, and it is easy to check that choosing 
eq. (23.11) for the fields does not yield an extra minus sign in eq. (23.16) 
for the energy-momentum four- vector. 

Let us reconsider the origin of eq. (23.14). We first recall that the space- 
time translation operator 

T(a) = exp(— iP-a) . (23.17) 

(which should not be confused with the time-reversal operator T) trans- 
forms a scalar held according to 

T(a)~ 1 tp(x)T(a) = ip(x — a) . (23.18) 

The spacetime translation operator is a scalar with a spacetime coordinate 
as a label; by analogy with eq. (23.3), we should have 

U (A)~ 1 T(a)U (A) = T(A~ 1 a) . (23.19) 

Now, treat as infinitesimal in eq. (23.19) to get 

U(A)-\l-i ail P»)U{A) = I - z(A _1 ) i /a /t P i ' 

= I - iA^a^-P" . (23.20) 

For time-reversal, this becomes 

T _1 (J - ia ll P l ‘)T = I- i,T fJ ' u a fl P l/ . (23.21) 

If we now identify the coefficients of — ia ^ on each side, we get eq. (23.16), 
which is bad. In order to get the extra minus sign that we need, we must 
impose the antiunitary condition 

T-HT = -i . (23.22) 

2 When spin-one-half fields are present, it may be that no operator exists that satisfies 
either eq. (23.6) or eq. (23.10) and also eq. (23.15); in this case we say that parity is 
explicitly broken. 
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We then find 

T - 1 pn T = —T^ U P U (23.23) 

instead of eq. (23.16). This yields T~ 1 HT = +H, which is the correct 
expression of time-reversal invariance. 

We turn now to other unitary operators that change the signs of scalar 
fields, but do nothing to their spacetime arguments. Suppose we have a 
theory with real scalar fields p a (x), and a unitary operator Z that obeys 

Z~ 1 p a (x)Z = r] a (p a (x ) , (23.24) 


where rj a is either +1 or —1 for each field. We will call Z a Z 2 operator, be- 
cause Z 2 is the additive group of the integers modulo 2, which is equivalent 
to the multiplicative group of +1 and —1. This also implies that Z 2 = 1, 
and so Z = Z. (For theories with spin-zero fields only, the same is also 
true of P and T, but things are more subtle for higher spin, as we will see 
in Part II.) 

Consider the theory of a complex scalar field (p = (p\ + ip 2 )/V 2 that 
was introduced in section 22, with lagrangian 

C = —d^p^d^p — m 2 p*p - (<£>V) 2 (23.25) 

= -^pid^p! - \d^p 2 d lx p 2 - \m 2 (p\ + p\) - ygA (p\ + pi) 2 . (23.26) 


In the form of eq. (23.25), C is obviously invariant under the U(l) transfor- 
mation 

(p(x) -> e~ ia p(x) . (23.27) 

In the form of eq. (23.26), C is obviously invariant under the equivalent 
SO(2) transformation, 


/ ¥>i(®)\ 

\<7>2 (x)J 


cos a 
— sin a 


sin a 
cos a 


{ ^i(*)\ 
\<P2(x)J 


(23.28) 


However, it is also obvious that C has an additional discrete symmetry, 


p(x) p'{x) 


(23.29) 


in the form of eq. (23.25), or equivalently 

/ pi(x)\ / +1 0 \ / <^>i(®)\ 

\¥^(*)/ V 0 - 1 / \<P2(x)J 


(23.30) 


in the form of eq. (23.26). This discrete symmetry is called charge conju- 
gation. It always occurs as a companion to a continuous U(l) symmetry. 
In terms of the two real fields, it enlarges the group from SO(2) (the group 
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of 2 x 2 orthogonal matrices with determinant +1) to 0(2) (the group of 
2x2 orthogonal matrices). 

We can implement charge conjugation by means of a particular Z 2 op- 


erator C that obeys 

C 1 <p(x)C = ip\x) , 

(23.31) 

or equivalently 

C~ 1 ipi(x)C = +ipi(x) , 

(23.32) 


C~ 1 tp 2 (x)C = -ip 2 {x) . 

(23.33) 

We then have 

C~ 1 C(x)C = C{x) , 

(23.34) 


and so charge conjugation is a symmetry of the theory. Physically, it implies 
that the scattering amplitudes are unchanged if we exchange all the a-type 
particles (which have charge +1) with all the 5-type particles (which have 
charge —1). This means, in particular, that the a and b particles must have 
exactly the same mass. We say that b is a’s antiparticle. 

More generally, we can also have Z 2 symmetries that are not related 
to antiparticles. Consider, for example, theory, where ip is a real scalar 
field with lagrangian 

C = —\d^ipd^ip — \m 2 <p 2 — jg\ip A ■ (23.35) 

If we define the Z 2 operator Z via 

Z~ 1 (f(x)Z = —ip{x) , (23.36) 

then C is obviously invariant. We therefore have Z~ l HZ = H, or equiva- 
lently [Z. H] = 0, where H is the hamiltonian. If we assume that (as usual) 
the ground state is unique, then, since Z commutes with H , the ground 
state must also be an eigenstate of Z . We can fix the phase of Z [which is 
undetermined by eq. (23.36)] via 

Z|0) = Z _1 |0) = +|0) . (23.37) 

Then, using eqs. (23.36) and (23.37), we have 

(0|p(x)|0> = (0|ZZ"V(x)ZZ -1 |0) 

= — (0|(/?(x)|0) . (23.38) 

Since (0|<^(x)|0) is equal to minus itself, it must be zero. Thus, as long as 
the ground state is unique, the Z 2 symmetry of ip 4 theory guarantees that 
the field has zero vacuum expectation value. We therefore do not need to 
enforce this condition with a counterterm Y ip , as we did in (p 5 theory. (The 
assumption of a unique ground state does not necessarily hold, however, as 
we will see in section 30.) 
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24 Nonabelian Symmetries 

Prerequisite: 22 

Consider the theory (introduced in section 22) of a two real scalar fields ip\ 
and with 

£ = - 20^20/^2 - + £ 2 ) - + £s) 2 • (24.1) 

We can generalize this to the case of N real scalar fields ipi with 

C = —d^ipid^i - \m 2 ywi - {(pm) 2 , (24.2) 

where a repeated index is summed. This lagrangian is clearly invariant 
under the SO(IV) transformation 

<Pi(x) -> Rij<pj(x ) , (24.3) 

where R is an orthogonal matrix with a positive determinant: R T = R~ l , 
det R = +1. This largrangian is also clearly invariant under the Z 2 trans- 
formation cpi(x ) — > — tpi(x), which enlarges SO(IV) to O(IV); see section 
23. However, in this section we will be concerned only with the continuous 
SO (AT) part of the symmetry. 

Next we will need some results from group theory. Consider an infinites- 
imal SO (N) transformation, 

Rij = Sij + 0^ + 0(9 2 ) . (24.4) 

Orthogonality of Rij implies that 0 l3 is real and antisymmetric. It is con- 
venient to express 0 l; j in terms of a basis set of hermitian matrices (T a )jj. 
The index a runs from 1 to ^N(N-l), the number of linearly independent, 
hermitian, antisymmetric, N x N matrices. We can, for example, choose 
each T a to have a single nonzero entry — i above the main diagonal, and 
a corresponding +i below the main diagonal. These matrices obey the 
normalization condition 


Tr (T a T b ) = 25 ab . (24.5) 

In terms of them, we can write 

9 jk = —i9 a (T a ) jk , (24.6) 

where 9 a is a set of 7)N(N— 1) real, infinitesimal parameters. 

The T a, s are the generator matrices of SO(AT). The product of any two 
SO (N) transformations is another SO(N) transformation; this implies (see 
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problem 24.2) that the commutator of any two generator matrices must be 
a linear combination of generator matrices, 

[T a ,T b ] = if abc T c . (24.7) 

The numerical factors f abc are the structure coefficients of the group, and 
eq. (24.7) specifies its Lie algebra. If f abc = 0, the group is abelian. Other- 
wise, it is nonabelian. Thus, U(l) and SO(2) are abelian groups (since they 
each have only one generator that obviously must commute with itself), 
and SO(IV) for N > 3 is nonabelian. 

If we multiply eq. (24.7) on the right by T d , take the trace, and use 
eq. (24.5), we find 

f abd = _±iTr([T a ,T b ]T d ) . (24.8) 

Using the cyclic property of the trace, we find that f abd must be completely 
antisymmetric. Taking the complex conjugate of eq. (24.8) (and remember- 
ing that the T a, s are hermitian matrices), we find that f abcl must be real. 

The simplest nonabelian group is SO(3). In this case, we can choose 
(' T a )ij = — where e^ k is the completely antisymmetric Levi-Civita 
symbol, with e 123 = +1. The commutation relations become 

[T a ,T b ] = ie abc T c . (24.9) 

That is, the structure coefficients of SO(3) are given by f abc = e abc . 

Consider now a theory with N complex scalar fields cpi, and a lagrangian 

C = -cU<4 d^i - m 2 (pl<pi - ±A (<p\<pi) 2 , (24.10) 

where a repeated index is summed. This lagrangian is clearly invariant 
under the U(1V) transformation 

<Pi(x) Uijipj(x ) , (24.11) 

where U is a unitary matrix: W = U~ 1 . We can write U\j = e~ i8 Uij, 
where 6 is a real parameter and det U VJ = +1; Uij is called a special unitary 
matrix. Clearly the product of two special unitary matrices is another 
special unitary matrix; the N X N special unitary matrices form the group 
SU(IV). The group U(IV) is the direct product of the group U(l) and the 
group we write U(IV) = U(l) x SU(A' r ). 

Consider an infinitesimal SU(IV) transformation, 

Uij = Sij - i9 a (T a )ij + 0(9 2 ) , (24.12) 

where 9 a is a set of real, infinitesimal parameters. Unitarity of U implies 
that the generator matrices T a are hermitian, and det U = +1 implies 
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that each T a is traceless. (This follows from the general matrix formula 
lndet A = Trln A) The index a runs from 1 to IV 2 — 1, the number of lin- 
early independent, hermitian, traceless, N x N matrices. We can choose 
these matrices to obey the normalization condition of eq. (24.5). For SU(2), 
the generators can be chosen to be the Pauli matrices; the structure coeffi- 
cients of SU(2) then turn out to be f abc = 2e abc , the same as those of SO(3), 
up to an irrelevant overall factor [which could be removed by changing the 
numerical factor on right-hand side of eq. (24.5) from 2 to ?,] . 

For SU(IV), we can choose the T a ' s in the following way. First, there 
are the SO (IV) generators, with one —i above the main diagonal a corre- 
sponding +i below; there are ^N(N-l) of these. Next, we get another set 
by putting one +1 above the main diagonal and a corresponding +1 below; 
there are ^N(N-l) of these. Finally, there are diagonal matrices with n 
l’s along the main diagonal, followed a single entry — n, followed by zeros 
[times an overall normalization constant to enforce eq. (24.5)]; the are N — 1 
of these. The total is IV 2 — 1, as required. 

However, if we examine the lagrangian of eq. (24.10) more closely, we 
find that it is actually invariant under a larger symmetry group, namely 
SO(2 N). To see this, write each complex scalar field in terms of two real 
scalar fields, (pj = (tpji + ipj ?) Then 

Vj = |(Pii + <£12 + • • • + Pni + Pm) ■ (24.13) 

Thus, we have 2 N real scalar fields that enter C symmetrically, and so 
the actual symmetry group of eq. (24.10) is SO(2IV), rather than just the 
obvious subgroup U(JV). 

We will, however, meet the SU (IV) groups again in Parts II and III, 
where they will play a more important role. 

Problems 

24.1) Show that 0 t] in eq. (24.4) must be antisymmetric if R is orthogonal. 

24.2) By considering the SO (IV) transformation R'~ 1 R~ 1 R'R, where R 
and R' are independent infinitesimal SO (IV) transformations, prove 
eq. (24.7). 

24.3) a) Find the Noether current j afl for the transformation of eq. (24.6). 

b) Show that [pi,Q a ] = ( T a )ijipj , where Q a is the Noether charge. 

c) Use this result, eq. (24.7), and the Jacobi identity (see problem 2.8) 
to show that [Q a ,Q b ] = if abc Q c . 
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24.4) The elements of the group SO (N) can be defined as N x N matrices 
R that satisfy 

Rii' Rjj/Si' ji = Sij . (24.14) 

The elements of the symplectic group Sp(2iV) can be defined as 2N x 
2N matrices S that satisfy 


Sii' SjjiTji'ji 7]ij , (24.15) 

where the symplectic metric ipj is antisymmetric, ry-j = —rjji, and 
squares to minus the identity: r/ 2 = —I. One way to write r/ is 

o)’ (2416) 

where / is the N x N identity matrix. Find the number of generators 
of Sp(2 N). 
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25 Unstable Particles and Resonances 

Prerequisite: 14 

Consider a theory of two real scalar fields, <p and Xi with lagrangian 

C = ~^d ll ipd tJ <p - \m\g> 2 - "" h m lx 2 + . (25.1) 

This theory is renormalizable in six dimensions, where g and h are dimen- 
sionless coupling constants. 

Let us assume that m, P > 2 m x . Then it is kinematically possible for 
the (p particle to decay into two x particles. The amplitude for this process 
is given at tree level by the Feynman diagram of fig. (25.1), and is simply 
T = g. We can also choose to define g as the value of the exact <px 2 vertex 
function \:s(k, k \ , k 2 ) when all three particles are on shell: k 2 = — m 2 , 
k'i = k'^ = This implies that 

T = g (25.2) 

exactly. 

According to the formulae of section 11, the differential decay rate (in 
the rest frame of the initial p> particle) is 

dr = — !— dLIPS 2 \T\ 2 , (25.3) 

where dLIPS 2 is the Lorentz invariant phase space differential for two out- 
going particles, introduced in section 11. We must make a slight adaptation 
for six dimensions: 


dLIPS 2 = (2tt) g ( k'l+k^—k) dk\ dk' 2 . (25.4) 


Here k = 0 ) is the energy- momentum of the decaying particle, and 


dk 


d 5 k 

(2tt) 5 2w 


(25.5) 


is the Lorentz-invariant phase-space differential for one particle. Recall that 
we can also write it as 

,fir 

dk = ' 6 27t 5{k 2 + m 2 ) 9(k°) , (25.6) 

(27T) u a 

where 0{x) is the unit step function. Performing the integral over k° turns 
eq. (25.6) into eq. (25.5). 
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Figure 25.1: The tree-level Feynman diagram for the decay of a (p particle 
(dashed line) into two x particles (solid lines). 

Repeating for six dimensions what we did in section 11 for four dimen- 
sions, we find 

" LIPS2 =4(^ rffi - < 257 > 

where | k / ] | = ^(jb 2 — 4 m 2 ) 1 / 2 is the magnitude of the spatial momentum 
of one of the outgoing particles. We can now plug this into eq. (25.3), and 
use f dfl = fl ,5 = 27 r 5 / 2 /r(|) = |7r 2 . We also need to divide by a symmetry 
factor of two, due to the presence of two identical particles in the final state. 
The result is 

= ^7ra(l - 4m 2 /m^) 3/2 , 

where a = g 2 /( 47r) 3 . 

However, as we discussed in section 11, we have a conceptual problem. 
According to our development of the LSZ formula in section 5, each incom- 
ing and outgoing particle should correspond to a single-particle state that 
is an exact eigenstate of the exact hanriltonian. This is clearly not the case 
for a particle that can decay. 

Let us, then, compute something else instead: the correction to the cp 
propagator from a loop of x particles, as shown in fig. (25.2). The diagram 
is the same as the one we already analyzed in section 14, except that the 
internal propagators contain m x instead of rri v . (There is also a contri- 
bution from a loop of <p particles, but we can ignore it if we assume that 
h < g.) We have 

n (k 2 ) = \a dx DlnD - A'k 2 - B'm % , (25.10) 


(25.8) 

(25.9) 


where 


D = x{l— x)k 2 + m x — ie 


(25.11) 
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Figure 25.2: A loop of x particles correcting the (p propagator. 

and A! and B' are the finite counterterm coefficients that remain after the 
infinities have been absorbed. We now try to fix A' and B' by imposing 
the usual on-shell conditions II(— fn 2 ) = 0 and IT(— m 2 ) = 0. 

But, we have a problem. For k 2 = — m 2 and m^ > 2 m x , D is negative 
for part of the range of x. Therefore In D has an imaginary part. This 
imaginary part cannot be canceled by A' and B' , since A' and B' must be 
real: they are coefficients of hermitian operators in the lagrangian. The 
best we can do is ReII(— m 2 ) = 0 and Re II '(— m 2 ) = 0. Imposing these 
gives 

II(A: 2 ) = \ol f dx D ln(D/\Do\) — ^cc(/c 2 + m 2 ) , (25.12) 

J 0 

where 

Dq = —x(l — x)m 2 + m 2 . (25.13) 

Now let us compute the imaginary part of II(/c 2 ). This arises from the 
integration range X- < x < x + , where x± = ^ ± ^(1 + m 2 /^ 2 ) 1 / 2 are 
the roots of D = 0 when k 2 < —4m 2 . In this range, Iiri In D = — Z7r; the 
minus sign arises because, according to eq. (25.11), D has a small negative 
imaginary part. Now we have 

Im U(k 2 ) = —\n a I dx D 

J X- 

= -^7ra(l + 4 m 2 /fc 2 ) 3/2 k 2 (25.14) 

when k 2 < —4m 2 . Evaluating eq. (25.14) at k 2 = — m 2 , we get 

ImII(— m 2 ) = j^7ra(l - 4 m 2 /m 2 ) 3//2 m 2 . (25.15) 

From this and eq. (25.9), we see that 

ImII(— m 2 ) = rritpT . (25.16) 

This is not an accident. Instead, it is a general rule. We will argue 
this in two ways: first, from the mathematics of Feynman diagrams, and 
second, from the physics of resonant scattering in quantum mechanics. 
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We begin with the mathematics of Feynman diagrams. Return to the 
diagrammatic expression for n(£: 2 ), before we evaluated any of the integrals: 

n 0 2 ) = -¥a 2 J 2TT) 6 5 6 (h+£ 2 -k) 

1 l 

X 

l\ + m 2 — ie i I + m 2 — ie 

-( Ak 2 + Bml ). (25.17) 

Here, for later convenience, we have assigned the internal lines momenta 
t\ and £ 2 , and explicitly included the momentum-conserving delta function 
that fixes one of them. We can take the imaginary part of n(/c 2 ) by using 
the identity 

= P — h Z7T 5(x) , (25.18) 

x — ie x 

where P means the principal part. We then get, in a shorthand notation, 

Im no 2 ) = -\a 2 J (t 3 i P‘ 2 . - 7r 2 <5i<5 2 ) • (25.19) 

Next, we notice that the integral in eq. (25.17) is the Fourier transform 
of [A (x—y)] 2 , where 


A (x-y) = J 


d 6 k 

(27 r) 6 k 2 + m 2 — ie 


(25.20) 


is the Feynman propagator. Recall (from problem 8.5) that we can get the 
retarded or advanced propagator (rather than the Feynman propagator) 
by replacing the e in eq. (25.20) with, respectively, — se or +se, where s = 
signO°). Therefore, in eq. (25.19), replacing <5i with — si<5i and <5 2 with 
+52^2 yields an integral that is the real part of the Fourier transform of 
A re tO— y)A a( j v O— y). But this product is zero, because the first factor 
vanishes when x° > y°, and the second when x° < y°. So we can subtract 
the modified integrand from the original without changing the value of the 
integral. Thus we have 


Imn(/c 2 ) = ^g 2 7r 2 J (1 + sis 2 )(5i<5 2 . 


(25.21) 


The factor of l + sis 2 vanishes if and have opposite signs, and equals 2 
if they have the same sign. Because the delta function in eq. (25.17) enforces 
^1 + ^2 = &°> and k° = m v is positive, both and 6^ must be positive. 
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So we can replace the factor of 1 + si «2 in eq. (25.21) with 20(£5)#(^2)- 
Rearranging the numerical factors, we have 

Imn ( fc 2 ) = \pj lie (0(2 *me l+ e 2 -k) 

x 2tt 8{i\ + ni^)d(li) 2ir5(£l + rrv^)d{^) . (25.22) 

If we now set k 2 = —m^, use eqs. (25.4) and (25.6), and recall that T = g 
is the decay amplitude, we can rewrite eq. (25.22) as 

Imn(-mJ) = \J dLIPS 2 \T\ 2 . (25.23) 


Comparing eqs. (25.8) and (25.23), we see that we indeed have 

ImII(— m^p) = m, p T . (25.24) 


This relation persists at higher orders in perturbation theory. Our anal- 
ysis can be generalized to give the Cutkosky rules for computing the imag- 
inary part of any Feynman diagram, but this is beyond the scope of our 
current interest. 

To get a more physical understanding of this result, recall that in non- 
relativistic quantum mechanics, a metastable state with energy Eq and 
angular momentum quantum number t shows up as a resonance in the 
partial-wave scattering amplitude, 


ME) ~ 


1 

E — Eq + iT/2 


(25.25) 


If we imagine convolving this amplitude with a wave packet ip(E)e lEt , we 
will find a time dependence 


Mt) 


dE 


E — Eq + iT /2 


V>C E)e 


—iEt 


^ e -iE 0 t-Tt/2 ' 


(25.26) 


and we identify T as the inverse lifetime of the 


Therefore \i/>(t)\ 2 ~ e~ rt , 
metastable state. 

In the relativistic case, consider the scattering process XX " XX- The 
contributing diagrams from the effective action are those of fig. (20.1), 
where the exact internal propagator can be either cp or y. Suppose that 
the center-of-mass energy squared s is close to m p . Since the <p progator 
has a pole near s = rn p . .s-channel ip exchange, shown in fig. (25.3), makes 
the dominant contribution to the XX scattering amplitude. We then have 


T 


9 


—s + ni 2 , - II(— s) 


(25.27) 
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Figure 25.3: For s near the dominant contribution to XX scattering is 
s-channel ip exchange. 


Here we have used the fact that the exact (fXX vertex has the value g when 
all three particles are on-shell. Now let us write 


s = (m^ + ef ~ + 2m (p e , 


(25.28) 


where e <C rrim is the amount of energy by which our incoming particles are 
off resonance. We find 


r ~ 


-g' 2 /2m lf 

e + U(-rri* > )/2m ip ' 


(25.29) 


Recalling that Ren (— nrify = 0, and comparing with eq. (25.25), we see that 
we should make the identification of eq. (25.24). 


Reference Notes 


The Cutkosky rules are discussed in more detail in Peskin & Schroeder. 
More details on resonances can be found in Weinberg I. 
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26 Infrared Divergences 

Prerequisite: 20 


In section 20, we computed the (pip — > (pip scattering amplitude in ip 3 theory 
in six dimensions in the high-energy limit (s, |t|, and |u| all much larger 
than to 2 ). We found that 


T = Tn 


1 - 


a 


(in (s/to 2 ) + 0(m°)^ + 0(a 2 ) 


(26.1) 


where To = —g 2 (s^ 1 + f -1 + if” 1 ) is the tree-level result, and the 0(m°) 
term includes everything without a large logarithm that blows up in the 
limit to — > 0. 1 

Suppose we are interested in the limit of massless particles. The large 
log is then problematic, since it blows up in this limit. What does this 
mean? 

It means we have made a mistake. Actually, two mistakes. In this 
section, we will remedy one of them. 

Throughout the physical sciences, it is necessary to make various ide- 
alizations in order to make progress. (Recall the “massless springs” and 
“frictionless planes” of freshman mechanics.) Sometimes these idealiza- 
tions can lead us into trouble, and that is one of the things that has gone 
wrong here. 

We have assumed that we can isolate individual particles. The reasoning 
behind this was explained in section 5, and it depends on the existence of an 
energy gap between the one-particle states and the multiparticle continuum. 
However, this gap vanishes if the theory includes massless particles. In this 
case, it is possible that the scattering process involved the creation of some 
extra very low energy (or soft) particles that escaped detection. Or, there 
may have been some extra soft particles hiding in the initial state that 
discreetly participated in the scattering process. Or, what was seen as a 
single high-energy particle may actually have been two or more particles 
that were moving colinearly and sharing the energy. 

Let us, then, correct our idealization of a perfect detector and account 
for these possibilities. We will work with p 3 theory, initially in d spacetime 
dimensions. 

Let T be the amplitude for some scattering process in <p 3 theory. Now 
consider the possibility that one of the outgoing particles in this process 
splits into two, as shown in fig. (26.1). The amplitude for this new process 

Til writing T in this form, we have traded factors of Inf and In it for Ins by first using 
Inf = Ins + ln(f/s), and then hiding the ln(f/s) terms in the O(m 0 ) catchall. 
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Figure 26.1: An outgoing particle splits into two. The gray circle stands 
for the sum of all diagrams contributing to the original amplitude iT. 

is given in terms of T by 

^piit = ig 

where k = k\ + & 2 , and k\ and k^ are the on-shell four-momenta of the 
two particles produced by the split. (For notational convenience, we drop 
our usual primes on the outgoing momenta.) The key point is this: in the 
massless limit, it is possible for 1 /{k 2 + m 2 ) to diverge. 

To understand the physical consequences of this possibility, we should 
compute an appropriate cross-section. To get the cross section for the 
original process (without the split), we multiply \T\ 2 by dk (as well as by 
similar differentials for other outgoing particles, and by an overall energy- 
momentum delta function). For the process with the split, we multiply 
l^spiitl 2 by \dk\dk -2 instead of dk. (The factor of one-half is for counting 
of identical particles.) If we assume that (due to some imperfection) our 
detector cannot tell whether or not the one particle actually split into two, 
then we should (according to the usual rules of quantum mechanics) add 
the probabilities for the two events, which are distinguishable in principle. 
We can therefore define an effectively observable squared-amplitude via 

\T\ 2 ohs dk = |Tj 2 dk + |7^ plit | 2 \dk 1 dk 2 + . . . . (26.3) 

Here the ellipses stand for all other similar processes involving emission of 
one or more extra particles in the final state, or absorption of one or more 
extra particles in the initial state. 

We can simplify eq. (26.3) by including a factor of 

1 = (2vr) d - 1 2w $ d " 1 (ki+k 2 -k)d)fe 


m 


■r, 


(26.2) 


(26.4) 
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in the second term. Now all terms in eq. (26.3) include a factor of dk , so 
we can drop it. Then, using eq. (26.2), we get 


T 


2 _ 
lobs — 


T I 2 


1 + 


(fc2 + m 2y ( 2?r ) d 1 (k i +k 2 -k) \dk x dk 2 + . . . . 


(26.5) 

Now we come to the point: in the massless limit, the phase space integral 
in the second term in eq. (26.5) can diverge. This is because, for m = 0, 

k 2 = ( k\ + fc 2 ) 2 = — 4(ji(j 2 sin 2 (6 , /2) , (26.6) 


where 0 is the angle between the spatial momenta ki and k 2 , and uq j2 = 
|ki j2 |. Also, for m = 0, 

dk\dk 2 ~ (u;f -3 dull) (w 2 ~ 3 dui 2 ) (sin d-3 6 dd ) . (26.7) 


Therefore, for small 9, 


dk\dk2 duii dui2 dd 
0 k 2 ) 2 ~ ' 


(26.8) 


Thus the integral over each ui diverges at the low end for d < 4, and the 
integral over 9 diverges at the low end for d < 6. These divergent integrals 
would be cut off (and rendered finite) if we kept the mass m nonzero, as 
we will see below. 

Our discussion leads us to expect that the m — > 0 divergence in the 
second term of eq. (26.5) should cancel the m — > 0 divergence in the loop 
correction to |T| 2 . We will now see how this works (or fails to work) in detail 
for the familiar case of two-particle scattering in six spacetime dimensions, 
where T is given by eq. (26.1). For d = 6, there is no problem with soft 
particles (corresponding to the small-(j divergence), but there is a problem 
with collinear particles (corresponding to the small-0 divergence). 

Let us assume that our imperfect detector cannot tell one particle from 
two nearly collinear particles if the angle 9 between their spatial momenta 
is less than some small angle 5. Since we ultimately want to take the m — ► 0 
limit, we will evaluate eq. (26.5) with m 2 /k 2 <C 5 2 <C 1. 

We can immediately integrate over d 5 fc 2 using the delta function, which 
results in setting k 2 = k — ki everywhere. Let (3 then be the angle between 
ki (which is still to be integrated over) and k (which is fixed). For two- 
particle scattering, |kj = 4^/s in the limit m — > 0. We then have 


(27t) 5 2 w 5 5 (ki+k 2 — k) ^dkidk2 


12 4 Ul 
4(27t) 5 uqu; 2 


ki| 4 d|ki| sin 3 /? d/3 , 

(26.9) 



26: Infrared Divergences 


170 


where H4 = 27t 2 is the area of the unit four-sphere. Now let 7 be the angle 
between k2 and k. The geometry of this trio of vectors implies 6 = [3 + 7, 
|ki| = (sin7/sin#)|k|, and |k2 1 = (sin /3/sin 0)|k|. All three of the angles are 
small and positive, and it then is useful to write /3 = xO and 7 = (1 —x)6, 
with 0 < x < 1 and 0<5<1. 

In the low mass limit, we can safely set m = 0 everywhere in eq. (26.5) 
except in the propagator, 1 /(k 2 + m 2 ). Then, expanding to leading order 
in both 6 and m, we find (after some algebra) 


k 2 + m 2 ~ — x(l— x)k 2 6 2 + (m 2 /k 2 )/(x) 


(26.10) 


where f(x) = (1— x+x 2 )/(x— x 2 ) 2 . Everywhere else in eq. (26.5), we can 
safely set oj\ = |ki| = (1— x)|k| and 0 J 2 = |k2 1 = x|k|. Then, changing the 
integration variables in eq. (26.9) from |ki| and (3 to x and 9, we get 


l^lobs 


= ITI 


1 + 


4 

4(2vr) 


x(l—x)dx 


r s 9 3 dd 

'0 [6 2 + ( ? n2/k2)/(x-)] 2 


Performing the integral over 9 yields 


+ ... . 

(26.11) 


ln((5 2 k 2 /'m 2 ) — ^ In f(x) — \ . (26.12) 

Then, performing the integral over x and using H4 = 27t 2 and a = g 2 / (47r) 3 , 
we get 


T 


2 

obs 


T I 2 


1 + ^a(ln(e> 2 k 2 /?7? 2 ) + cj + ■ ■ ■ , 


(26.13) 


where c = (4 — 3s/3tt)/3 = —4.11. 

The displayed correction term accounts for the possible splitting of one 
of the two outgoing particles. Obviously, there is an identical correction 
for the other outgoing particle. Less obviously (but still true), there is 
an identical correction for each of the two incoming particles. (A glib 
explanation is that we are computing an effective amplitude-squared, and 
this is the same for the reverse process, with in and outgoing particles 
switched. So in and out particles should be treated symmetrically.) Then, 
since we have a total of four in and out particles (before accounting for any 
splitting), 


T 


I 2 

lobs 


m 2 


1 + /|a^ln(<5 2 k 2 /?n 2 ) + cj + 0(a 2 ) . 


(26.14) 


We have now accounted for the 0(a) corrections due to the failure of 
our detector to separate two particles whose spatial momenta are nearly 
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parallel. Combining this with eq. (26.1), and recalling that k 2 = we get 

|T| 2 bs = l^l 2 1 ~ ^a(ln(s/m 2 ) + O(m 0 )) + 0(a 2 ) 
x 1 + ^a[\n(5 2 s / m 2 ) + O(m 0 )^j + 0(a 2 ) 

= |7o| 2 1 — a(§ ln(s/m 2 ) + ^ ln(l/5 2 ) + 0(m°)j 

+ 0(a 2 )]. (26.15) 

We now have two kinds of large logs. One is ln(l/e> 2 ); this factor depends 
on the properties of our detector. If we build a very good detector, one 
for which a\n(l/5‘ 2 ) is not small, then we will have to do more work, and 
calculate higher-order corrections to eq. (26.15). 

The other large log is our original nemesis ln(.s/m 2 ). This factor blows 
up in the massless limit. This means that there is still a mistake hidden 
somewhere in our analysis. 

Reference Notes 

Infrared divergences in quantum electrodynamics are discussed in Brown 
and Peskin & Schroeder. More general treatments can be found in Sterman 
and Weinberg I. 
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27 Other Renormalization Schemes 

Prerequisite: 26 

To find the remaining mistake in eq. (26.15), we must review our renor- 
malization procedure. Recall our result from section 14 for the one-loop 
correction to the propagator, 

B(k 2 ) = — A + L - B + ^ m 2 

+ ha f dx D ln(D/g 2 ) + 0(a 2 ) , (27.1) 

“ Jo 

where a = g 2 /(4ir) 5 and D = x(\ —x)k 2 +m 2 . The derivative of Ii(k 2 ) with 
respect to k 2 is 

U'(k 2 ) = - A + + 1^ 

+ \ol [ dxx( 1 — x) In (D/ji 2 ) + 1 + 0(a 2 ) . (27.2) 

Jo L J 

We previously determined A and B via the requirements n(— m 2 ) = 0 and 
n' (—in 2 ) = 0. The first condition ensures that the exact propagator A (k 2 ) 
has a pole at k 2 = — m 2 , and the second ensures that the residue of this 
pole is one. Recall that the field must be normalized in this way for the 
validity of the LSZ formula. 

We now consider the massless limit. We have D = x(l— x)k 2 , and we 
should apparently try to impose n(0) = n'(0) = 0. However, n(0) is now 
automatically zero for any values of A and B, while n'(0) is ill defined. 

Physically, the problem is that the one-particle states are no longer sep- 
arated from the multiparticle continuum by a finite gap in energy. Mathe- 
matically, the pole in A (k 2 ) at k 2 = —m 2 merges with the branch point at 
k 2 = —Am 2 , and is no longer a simple pole. 

The only way out of this difficulty is to change the renormalization 
scheme. Let us first see what this means in the case m ^ 0, where we know 
what we are doing. 

Let us try making a different choice of A and B. Specifically, let 

A = —^a-£ + 0(ol 2 ) , 

B = - a A + 0(a 2 ). (27.3) 

Here we have chosen A and B to cancel the infinities, and nothing more; 
we say that A and B have no finite parts. This choice represents a different 
renormalization scheme. Our original choice (which, up until now, we have 
pretended was inescapable!) is called the on- shell or OS scheme. The choice 
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of eq. (27.3) is called the modified minimal- subtraction or MS (pronounced 
“emm-ess-bar” ) scheme. [“Modified” because we introduced g via g — > 
gfi £ / 2 , with fi 2 = 47re _7 /i 2 ; had we set g = fi instead, the scheme would be 
just plain minimal subtraction or MS.] Now we have 

nMs(& 2 ) = — ToOi(k 2 + 6m 2 ) + \ol [ dxDln(D/fr 2 ) + 0(a 2 ) , (27.4) 

J o 

as compared to our old result in the on-shell scheme, 

nos(fc 2 ) = — j 2 <y-(k 2 + m 2 ) + \a f dx D \n(D/Do) + 0(a 2 ) , (27.5) 

J 0 

where again D = x(l— x)k 2 + m 2 , and Dq = [—x{l—x)+l}m 2 . Notice that 
n^g(/c 2 ) has a well-defined m —> 0 limit, whereas n os (fc 2 ) does not. On the 
other hand, n^g^A: 2 ) depends explicitly on the fake parameter /j, whereas 
n os (A: 2 ) does not. 

What does this all mean? 

First, in the MS scheme, the propagator Aj^g(/c 2 ) will no longer have a 
pole at k 2 = —m 2 . The pole will be somewhere else. However, by definition , 
the actual physical mass m p h of the particle is determined by the location 
of this pole: k 2 = —m 2 h . Thus, the lagrangian parameter m is no longer 
the same as m p h- 

Furthermore, the residue of this pole is no longer one. Let us call the 
residue R. The LSZ formula must now be corrected by multiplying its 
right-hand side by a factor of R ~ 1 / 2 for each external particle (incoming 
or outgoing). This is because it is the field R~ l ^ 2 ip{x) that now has unit 
amplitude to create a one-particle state. 

Note also that, in the LSZ formula, each Klein-Gordon wave operator 
should be — <9 2 +?n 2 h , and not — <9 2 +m 2 ; also, each external four-momentum 
should square to — m 2 h , and not — m 2 . A review of the derivation of the 
LSZ formula clearly shows that each of these mass parameters must be the 
actual particle mass, and not the parameter in the lagrangian. 

Finally, in the LSZ formula, each external line will contribute a factor of 
R when the associated Klein-Gordon wave operator hits the external prop- 
agator and cancels its momentum-space pole, leaving behind the residue R. 
Combined with the correction factor of i? -1 / 2 for each field, we get a net 
factor of R 1 / 2 for each external line when using the MS scheme. Internal 
lines each contribute a factor of (— i)/(k 2 + to 2 ), where m is the lagrangian- 
parameter mass, and each vertex contributes a factor of iZ g g, where g is 
the lagrangian-parameter coupling. 

Let us now compute the relation between m and ?n p h, and then compute 
R. We have 

A* (k 2 )- 1 = k 2 + m 2 - n Ms(fe 2 ) , (27.6) 
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and, by definition, 

AMs(-^ph) -1 = 0 • (27.7) 

Setting k 2 = — m 2 h in eq. (27.6), using eq. (27.7), and rearranging, we find 

m ph = ~ iW-^ph) • (27.8) 

Since n^g(fc 2 ) is 0(a), we see that the difference between m 2 h and m 2 is 
0(a). Therefore, on the right-hand side, we can replace m 2 h with m 2 , and 
only make an error of 0(a 2 ). Thus 

m 2 h = m 2 - IIms(-to 2 ) + 0(a 2 ) . (27.9) 

Working this out, we get 

m 2 h = m 2 — ^a |m 2 — m 2 + f dx D$ ln(Z?o/^ 2 ) +0(a 2 ), (27.10) 

J 0 

where Do = [1— x(l— x)]m 2 . Doing the integrals yields 

m ph = m2 1 + ^a^ln(/r 2 /m 2 ) + + 0(a 2 ) . (27.11) 

where d = (34 — 37t\/3)/15 = 1.18. 

Now, physics should be independent of the fake parameter fi. However, 
the right-hand side of eq. (27.11) depends explicitly on /./. It must, be, then, 
that m and a take on different numerical values as n is varied, in just the 
right way to leave physical quantities (like m p h) unchanged. 

We can use this information to find differential equations that tell us 
how m and a change with //. For example, take the logarithm of eq. (27.11) 
and divide by two to get 

lnm p h = lnm + ^ a(ln(n/m ) + + 0(a 2 ) . (27.12) 

Now differentiate with respect to In fi and require m p h to remain fixed: 


0 = 


d 

d In /j, 


In m p h 


1 din 
m d In // 


+ + 0(a 2 ) . 


(27.13) 


To get the second line, we had to assume that da/d In// = 0(a 2 ), which 
we will verify shortly. Then, rearranging eq. (27.13) gives 


dm 
d In /i 


( — T 2 a + d)(a 2 )^j m . 


(27.14) 
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The factor in large parentheses on the right is called the anomalous dimen- 
sion of the mass parameter, and it is often given the name 7 m (a). 

Turning now to the residue R, we have 

R- 1 = -jL . (27.15) 

dk z l J „ , 

k 2 =—m, 

ph 

Using eq. (27.6), we get 

R~ l = nUg(— rn.p h ) 

= l-l4s(-m 2 ) + 0(a 2 ) 

= 1 + j^a{\n(ii 2 / m 2 ) + c"^ + 0(a 2 ) , (27.16) 

where c" = (17 — 37r\/3)/3 = 0.23. 

We can also use MS to define the vertex function. We take 

C = -a§ +0(a 2 ) , (27.17) 

and so 

V 3 ,Ms(fei, k 2 , k 3 ) = g l-\aj dF 3 In (D/y 2 ) + 0(a 2 ) (27.18) 

where D = xyk 2 + yzk 2 + zxk 2 + m 2 . 

Let us now compute the (ptp —► (ptp scattering amplitude in our fancy 
new renormalization scheme. In the low-mass limit, repeating the steps 
that led to eq. (26.1), and including the LSZ correction factor (i? 1 / 2 ) 4 , we 
get 

T = R 2 T 0 [l - %a(\n(s/y 2 ) + 0(m°)) + 0(a 2 )] , (27.19) 

where Tq = — g 2 (s~ 1 -) -t^ 1 +U -1 ) is the tree-level result. Now using R from 
eq. (27.16), we find 

T = Tq 1 — a(^j^ln(s/g 2 ) + ^ln(g 2 /m 2 ) + O(m 0 )j + 0(a 2 ) . (27.20) 

To get an observable amplitude-squared with an imperfect detector, we 
must square eq. (27.20) and multiply it by the correction factor we derived 
in section 26, 

|T| 2 bs = |T| 2 [l + \a{\n{5 2 s/m 2 ) + 0(m°)) + 0{a 2 )] , (27.21) 

where 5 is the angular resolution of the detector. Combining this with 
eq. (27.20), we get 

|T| 2 bs = \%? 1 - a(§ ln(s//z 2 ) + | ln(l/6 2 ) + O(m 0 fj + 0(a 2 ) . 

(27.22) 
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All factors of \nm 2 have disappeared! Finally, we have obtained an expres- 
sion that has a well-defined m — > 0 limit. 

Of course, p is still a fake parameter, and so |T| 2 bs cannot depend on 
it. It must be, then, that the explicit dependence on p in eq. (27.22) is 
canceled by the implicit p dependence of a. We can use this information 
to figure out how a must vary with p. Noting that \Tq\ 2 = 0{g 4 ) = 0(a 2 ), 
we have 


In |T| 2 bs = C'i + 21na + 3a(ln/r + C' 2 ) + 0(a 2 ) , (27.23) 

where C\ and C 2 are independent of p and a (but depend on the Mandel- 
stam variables). Differentiating with respect to In p then gives 


0 


d 

din p 


H^lcbs 


2 da 
a d Inp 


+ 3 a + 0(a 2 ) , 


or, after rearranging, 


da 
din p 


— | a 2 + 0(a 3 ) . 


(27.24) 


(27.25) 


The right-hand side of this equation is called the beta function. 

Returning to eq. (27.22), we are free to choose any convenient value of p 
that we might like. To avoid introducing unnecessary large logs, we should 
choose p 2 r\j s . 

To compare the results at different values of s, we need to solve eq. (27.25). 
Keeping only the leading term in the beta function, the solution is 


a(n 2 ) 


a{hi) 

1 + §«(// i)ln(p 2 /pi) 


(27.26) 


Thus, as p increases, a(fr) decreases. A theory with this property is said 
to be asymptotically free. In this case, the tree-level approximation (in the 
MS scheme with p? ~ s) becomes better and better at higher and higher 
energies. 

Of course, the opposite is true as well: as /j decreases, a(p) increases. 
As we go to lower and lower energies, the theory becomes more and more 
strongly coupled. 

If the particle mass is nonzero, this process stops at p ~ m. This 
is because the minimum value of s is 4?n 2 , and so the factor of li^s/ju 2 ) 
becomes an unwanted large log for p <C m. We should therefore not use 
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values of // below m. Perturbation theory is still good at these low energies 
if ot(m) <C 1. 

If the particle mass is zero, ct(n) continues to increase at lower and lower 
energies, and eventually perturbation theory breaks down. This is a signal 
that the low-energy physics may be quite different from what we expect on 
the basis of a perturbative analysis. 

In the case of </> 3 theory, we know what the correct low-energy physics 
is: the perturbative ground state is unstable against tunneling through the 
potential barrier, and there is no true ground state. Asymptotic freedom 
is, in this case, a signal of this impending disaster. 

Much more interesting is asymptotic freedom in a theory that does have 
a true ground state, such as quantum chromodynamics. In this example, 
the particle excitations are colorless hadrons, rather than the quarks and 
gluons we would expect from examining the lagrangian. 

If the sign of the beta function is positive, then the theory is infrared 
free. The coupling increases as fi increases, and, at sufficiently high en- 
ergy, perturbation theory breaks down. On the other hand, the coupling 
decreases as we go to lower energies. Once again, though, we should stop 
this process at ~ m if the particles have nonzero mass. Quantum elec- 
trodynamics with massive electrons (but, of course, massless photons) is in 
this category. 

Still more complicated behaviors are possible if the beta function has 
a zero at a nonzero value of a. We briefly consider this case in the next 
section. 


Reference Notes 

Minimal subtraction is treated in more detail in Brown , Collins, and Ra- 
mond I. 


Problems 

27.1) Suppose that we have a theory with 


/3(a) = b\a 2 + 0(a 3 ) , 
7 m(«) = ci a + 0(a 2 ) . 

Neglecting the higher-order terms, show that 


'a(p 2 ) 
_ot(m) 


1 ci/61 


m(n 1 ) . 


(27.27) 

(27.28) 


m(iJ 2 ) 


(27.29) 



28: The Renormalization Group 


178 


28 The Renormalization Group 

Prerequisite: 27 


In section 27 we introduced the MS renormalization scheme, and used the 
fact that physical observables must be independent of the fake parameter 
p to figure out how the lagrangian parameters m and g must change with 
p. In this section we re-derive these results from a much more formal (but 
calculationally simpler) point of view, and see how they extend to all or- 
ders of perturbation theory. Equations that tells us how the lagrangian 
parameters (and other objects that are not directly measurable, like cor- 
relation functions) vary with p are collectively called the equations of the 
renormalization group. 

Let us recall the lagrangian of our theory, and write it in two different 
ways. In d = 6 — e dimensions, we have 

C = - \Z m m 2 (p 2 + \Z g gp £/2 g > 3 + Yip (28.1) 

and 

C = -±< 9 ^ 03^0 - \rnlipl + \goTo + Vo^o • ( 28 . 2 ) 

The fields and parameters in eq. (28.1) are the renormalized fields and pa- 
rameters. (And in particular, they are renormalized using the MS scheme, 
with p 2 = 47T e _7 /i 2 .) The fields and parameters in eq. (28.2) are the bare 
fields and parameters. Comparing eqs. (28.1) and (28.2) gives us the rela- 
tionships between them: 


To 0) = zy 2 ip(x) , 

(28.3) 

m 0 = Z~ 1/2 Z^ 2 m , 

(28.4) 

go = Z~ 3 / 2 Z g gp £ / 2 , 

(28.5) 

Y 0 = Z- l ' 2 Y . 

(28.6) 


Recall that, after using dimensional regularization, the infinities coming 
from loop integrals take the form of inverse powers of e = 6 — d. In the 
MS renormalization scheme, we choose the Z's to cancel off these powers 
of 1/e, and nothing more. Therefore the Z's can be written as 



(28.7) 

(28.8) 


(28.9) 
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where a = g 2 /( 47t) 3 . Computing II(/c 2 ) and V-s(k \ . k?, hfi in perturbation 
theory in the MS scheme gives us Taylor series in a for a n (a), b n (a), and 
c n {a). So far we have found 


a\{a) 

— — + O(a^) , 

(28.10) 

bi{a) 

— — OL + O(cK^) , 

(28.11) 

ci (a) 

— — a + 0(a 2 ) , 

(28.12) 


and that a n (a), b n (a), and c n (a) are all at least 0(a 2 ) for n> 2. 

Next we turn to the trick that we will employ to compute the beta 
function for a, the anomalous dimension of m, and other useful things. 
This is the trick: bare fields and parameters must be independent of p. 

Why is this so? Recall that we introduced p when we found that we had 
to regularize the theory to avoid infinities in the loop integrals of Feynman 
diagrams. We argued at the time (and ever since) that physical quantities 
had to be independent of p. Thus p is not really a parameter of the theory, 
but just a crutch that we had to introduce at an intermediate stage of the 
calculation. In principle, the theory is completely specified by the values 
of the bare parameters, and, if we were smart enough, we would be able 
to compute the exact scattering amplitudes in terms of them, without ever 
introducing p. The point is this: since the exact scattering amplitudes are 
independent of p. the bare parameters must be as well. 

Let us start with g$. It is convenient to define 

«o = 5o/( 47r ) 3 = z g z fi 3 fi £ at , (28.13) 

and also 

G(a,e) = ln(Z 2 Z“ 3 ) . (28.14) 

From the general structure of eqs. (28.7) and (28.9), we have 

G(a,e) = Y.2^d, (28.15) 

n= 1 

where, in particular, 

G\{a) = 2ci(a) — 3ai(cc) 

= -§ a + 0(a 2 ). (28.16) 

The logarithm of eq. (28.13) can now be written as 

In ckq = G(a, e) + In a + e In p . (28.17) 
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Terms that are higher-order in 1/e must also cancel, and this determines 
all the other G' n (a)' s in terms of G\ (a). Thus, for example, cancellation 
of the O (e^ 1 ) terms fixes G' 2 (a) = aG^a) 2 . These relations among the 
G' n {a)'s can of course be checked order by order in perturbation theory. 
From eq. (28.21) and eq. (28.16), we find that the beta function is 

/3(a) = -fa 2 + 0(a 3 ) . (28.22) 

Hearteningly, this is the same result we found in section 27 by requiring the 
observed scattering cross section |T|° bs to be independent of fi. However, 
simply as a matter of practical calculation, it is much easier to compute 
G\(a ) than it is to compute |T|^ bs . 

Next consider the invariance of mo. We begin by defining 

M(a,e) = \n(Zll 2 Zy 1 / 2 ) 

y* M n (a) 


(28.23) 
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From eqs. (28.10) and (28.12) we have 


Mi (a) 

= ^&i(a) — ^ai(a) 



= — ^a + 0(a 2 ) . 

(28.24) 

Then, from eq. (28.4), we have 



In mo 

= Af(a,e) + lnm . 

(28.25) 


Take the derivative with respect to In /j and require mo to be unchanged: 

d 


0 = 


din /j 

dM(a , £ 
da 


In mo 


da 


+ 


1 dm 


d In p m d In p 

dM(a,e ) / .. ,\ 1 

aa \ > m 


dm 
m dhifi 


(28.26) 


Rearranging, we find 


1 dm 


m d In p 


OO 

= (ea - /3(a)) ^ 


ML, (a) 


n= 1 


= aM((a) + 


(28.27) 


where the ellipses stand for terms with powers of 1/e. In a renormalizable 
theory, dm/d In p should be finite in the e — > 0 limit, and so these terms 
must actually all be zero. Therefore, the anomalous dimension of the mass, 
defined via 

1 dm 

7m(a) = 


m d In p 


(28.28) 


is given by 


7m (a) = aM[(a) 


= — j^a + 0(a 2 ) 


(28.29) 


Comfortingly, this is just what we found in section 27. 

Let us now consider the propagator in the MS renormalization scheme, 


A (fc 2 ) = i J d 6 xe ikx { 0|T<^(xM0)|0> . (28.30) 

The bare propagator, 

A 0 (k 2 ) = i J d 6 xe ikx (0\TMx)M0)\0) , (28.31) 
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should be (by the now-familiar argument) independent of p. The bare and 
renormalized propagators are related by 

A 0 (A: 2 ) = Z^K{k 2 ) . (28.32) 

Taking the logarithm and differentiating with respect to In //, we get 

d 


0 = 


In A 0 (£r 


din p 
d In Zu- 


d In p d In p 


+ 


d 


■ In A {k 2 


d In Z,, 


+ 


1 ( 9 


+ _do L a_ + *?_8 ]Mk2y (28i33) 


d In p A (k 2 ) V <9 hi p dln/r 3a ri In p dm , 

We can write 


In = 


ai(a) 02 (a) — ^ai(a) 

T o T • • • 


(28.34) 


Then we have 


d In Z^ d In Z v da 


din p 


da d In p 
a'^ia) 


+ 


. . ea + /3(afj 


— — aa 1 (a) -)-... , 


(28.35) 


where the ellipses in the last line stand for terms with powers of 1/e. Since 
A (A: 2 ) should vary smoothly with p in the e — > 0 limit, these must all be 
zero. We then define the anomalous dimension of the field 


1 cilnZp 

> (q,s 2^7' 

From eq. (28.35) we find 

7 ^ (a) = -\aafia) 

= -t-yr^a + 0(a 2 ) . 

Eq. (28.33) can now be written as 

(alb + + 7m( “ )m b + 2 ^ (a) ) A(lc2) = 0 


(28.36) 


(28.37) 


(28.38) 
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in the e — > 0 limit. This is the Callan-Symanzik equation for the propagator. 

The Callan-Symanzik equation is most interesting in the massless limit, 
and for a theory with a zero of the beta function at a nonzero value of a. 
So, let us suppose that /3(a*) = 0 for some a* / 0. Then, for a = a* and 
m = 0, the Callan-Symanzik equation becomes 

(d^ + 27<p(a * ) ) A(k2)=0 ' (28 ' 39) 


The solution is 


A (k 2 ) 


C(a *) 
k 2 



-7 


(28.40) 


where C , (a*) is an integration constant. (We used the fact that A (k 2 ) 
has mass dimension —2 to get the k 2 dependence in addition to the p 
dependence.) Thus the naive scaling law A(k 2 ) ~ k~ 2 is changed to 
A (k 2 )~k-^~^l This has applications in the theory of critical phe- 
nomena, which is beyond the scope of this book. 


Reference Notes 


The formal development of the renormalization group is explored in more 
detail in Brown , Collins , and Ramond I. 

Problems 


28.1) Consider </ 2 4 theory, 

£ = -^Z^cpd^ip - \Z m m 2 v 2 - , (28.41) 

in d = 4 — £ dimensions. Compute the beta function to 0(A 2 ), the 
anomalous dimension of m to O(A), and the anomalous dimension of 
:p to 0(A). 

28.2) Repeat problem 28.1 for the theory of problem 9.3. 

28.3) Consider the lagrangian density 

£ = -\Z v d^ L ipd^ip - \Z m m 2 ip 2 + Yp 
- \Z x d^xdnX ~ \Z m M 2 x 2 
+ \Z g gp e l 2 ^ + \Z h hg £ ' 2 vx 2 (28.42) 

in d = 6 — e dimensions, where cp and x are rea l scalar fields, and Y 
is adjusted to make (0|<^(x)|0) = 0. (Why is no such term needed for 
X?) 
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a) Compute the one-loop contributions to each of the Z ’ s in the MS 
renormalization scheme. 

b) The bare couplings are related to the renormalized ones via 

go = Z~ 3 / 2 Z g gg £ / 2 , (28.43) 

ho = Z^Z-^Zhhjf' 2 . (28.44) 

Define 

G(g, h, e) = En=i Gn(g , h)e~ n = In (Z~ 3 / 2 Zg ) , (28.45) 

H(g, h, e) = i H n (g, h)e~ n = MZ~ x Z~ x l 2 Z h ) . (28.46) 


By requiring go and ho to be independent of g. and by assuming that 
dg/dg and dh/dg are finite as e — > 0, show that 


dg 

»Tg = 
dh 

»Tg = 


-\ £ g + \g(g 

, . / 0H X dHi\ 

-* £h+ * h (g-dr +h -dh) 


dG i , dG\ 

-W + h lm 

dH x 


(28.47) 

(28.48) 


c) Use your results from part (a) to compute the beta functions 

/ 3 g (g , h) = lim e ^o l J dg/dg and /3h(g, h) = lim£_>o l J dh/dg. Yon should 
find terms of order g 3 , gh 2 , and h 3 in /3 g , and terms of order g 2 h, gh 2 , 
and h 3 in /?/,.. 

d) Without loss of generality, we can choose g to be positive; h can 
then be positive or negative, and the difference is physically signifi- 
cant. (You should understand why this is true.) For what numerical 
range(s) of h/g are (3 g and fdh/h both negative? Why is this an inter- 
esting question? 
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29 Effective Field Theory 

Prerequisite: 28 

So far we have been discussing only renormalizable theories. In this section, 
we investigate what meaning can be assigned to nonrenormalizable theories, 
following an approach pioneered by Ken Wilson. 

We will begin by analyzing a renormalizable theory from a new point 
of view. Consider, as an example, <p 4 theory in four spacetime dimensions: 

C = —^Z^d^pd^p - ±Z m m ph <£ 2 ~ ^a Z \ A ph </ • (29.1) 

(This example is actually problematic, because this theory is trivial , a tech- 
nical term that we will exlain later. For now we proceed with a perturbative 
analysis.) We take the renormalizing Z factors to be defined in an on-shell 
scheme, and have emphasized this by writing the particle mass as m p h and 
the coupling constant as A p h- We define A p h as the value of the exact 1PI 
four-point vertex with zero external four-momenta: 


A p h = V 4 (0, 0, 0, 0) . 

The path integral is given by 

Z(J) = J TV e iS+i f Jlf , 

where S = f d 4 x £ and f J ip is short for / d 4 x Jp. 

Our first step in analyzing this theory will be to perform the Wick 
rotation (applied to loop integrals in section 14) directly on the action. We 
define a euclidean time r = it. Then we have 

Z(J) = J Vp e - SE ~f J<p , (29.4) 

where Se = f (1 4 x£e, d 4 x = d 3 xdr , 

£e = IZ^pd^p + lZ. m m^ h p 2 + 7^Z X A ph </ , (29.5) 

and 

dfj.pdfj.p = ( dp/dr ) 2 + (Vp) 2 . (29.6) 

Note that each term in Se is always positive (or zero) for any field configu- 
ration p(x). This is the advantage of working in euclidean space: eq. (29.4), 
the euclidean path integral , is strongly damped (rather than rapidly oscil- 
lating) at large values of the field and/or its derivatives, and this makes its 
convergence properties more obvious. 


(29.2) 

(29.3) 
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Next, we Fourier transform to (euclidean) momentum space via 

<29 ' 7) 

The euclidean action becomes 

1 f d 4 k / \ 

S E = ^ J JT^V(~ k )( Z V k2 + Z m m phj^O) 

+ ^Z x X ph j ^ ^ (27T) 4 5\k 1 +k 2 +k 3 +h) 

xip(ki)ip(k2)ip{k 3 )ip(k4) . (29.8) 

Note that k 2 = k 2 + k 2 > 0. 

We now introduce an ultraviolet cutoff A. It should be much larger than 
the particle mass m p h, or any other energy scale of practical interest. Then 
we perform the path integral over all fifk) with \k\ > A. We also take 


J(k) = 0 for \k\ > A. Then we find 


Z(J) = J V m<A e S«('rA)-fJ'P , 

(29.9) 

where 


e -S efl ( V ;A) = j Vlp{kl>A 6 -Se(v) . 

(29.10) 


£eff(<^;A) is called the Wilsonian effective action. We can write the corre- 
sponding lagrangian density as 


£ e ff(<^;A.) = \Z{A)dfj,^d^ + \m 2 (A)y 2 + ^A(A)y> 4 

+ EE c d}i (A)O dti , (29.11) 

d>6 i 

where the Fourier components of ip{x ) are now cut off at \k\ > A: 

r A 

<pw=j 0 j^r eikx ^- ( 29 - 12 ) 

The operators O d j in eq. (29.11) consist of all terms that have mass dimen- 
sion d > 6 and that are even under ip — (p; i is an index that distinguishes 
operators of the same dimension that are inequivalent after integrations by 
parts of any derivatives that act on the fields. (The operators must be even 
under <p <-» —<p in order to respect the tp — ip symmetry of the original 
lagrangian.) 

The coefficients Z( A), m 2 ( A), A(A), and c dl (JA) in eq. (29.11) are all 
finite functions of A. This is established by the following argument. We can 
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Figure 29.1: A one-loop 1PI diagram with 2 n external lines. Each external 
line represents a field with \k\ < A. The internal (dashed) line represents a 
field with \k\ > A. 

differentiate eq. (29.9) with respect to J(x) to compute correlation functions 
of the renormalized field ip(x ), and correlation functions of renormalized 
fields are finite. Using eq. (29.9), we can compute these correlation functions 
as a series of Feynman diagrams, with Feynman rules based on £ e ff- These 
rules include an ultraviolet cutoff A on the loop momenta, since the fields 
with higher momenta have already been integrated out. Thus all of the 
loop integrals in these diagrams are finite. Therefore the other parameters 
that enter the diagrams — Z( A), m 2 (A), A(A), and Cd^{ A) — must be finite 
as well, in order to end up with finite correlation functions. 

To compute these parameters, we can think of eq. (29.8) as the action 
for two kinds of fields, those with \k\ < A and those with \k\ > A. Then 
we draw all 1PI diagrams with external lines for \k\ < A fields only. For 
A p h *C 1, the dominant contribution to Cd,i{ A) for an operator Od,i with 2 n 
fields and d — 2n derivatives is then given by a one-loop diagram with 2 n 
external lines (representing \k\ < A fields), n vertices, and a \k\ > A field 
circulating in the loop; see fig. (29.1). 

The simplest case to consider is O 2 n> i = ip 2n . With 2 n external lines, 
there are (2n)! ways of assigning the external momenta to the lines, but 
2 n x n x 2 of these give the same diagram: 2” for exchanging the two external 
lines that meet at any one vertex; n for rotations of the diagram; and 2 
for reflection of the diagram. Since there are no derivatives on the external 
fields, we can set all of the external momenta to zero; then all (2n)!/(2"2n) 
diagrams have the same value. With a euclidean action, each internal line 
contributes a factor of 1 /(k 2 + m 2 h ), and each vertex contributes a factor 
of —Z\ A p h = — A p h + 0(A 2 h ). The vertex factor associated with the term 
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C 2 n,i(A)</ 2 2n in C c g is — (2n)! C 2 n ,i(A). Thus we have 
-(2n,!c2n,i(A) = r *L( 


2 n 2 n Ja (27t) 4 y k 2 + rn ph J 
+ 0(A”+ 1 ). (29.13) 

For 2 n > 6, the integral converges, and we find 

c - 4A > = -&Wa^ + 0( « 1) - <2al4> 


We have taken A 3> m p h, and dropped terms down by powers of m p h/A. 

For 2 n = 4, we have to include the tree-level vertex; in this case, we 
have 



+ 0(\ 3 ph ). (29.15) 


This integral diverges. To evaluate it, we note that the one- loop contribu- 
tion to the exact four-point vertex is given by the same diagram, but with 
fields of all momenta circulating in the loop. Thus we have 


-V 4 (0, 0,0,0) 



+ 0( x 3 ph ). (29.16) 


Then, using V 4 (0, 0, 0, 0) = A p h and subtracting eq. (29.15) from eq. (29.16), 
we get 


— A p h + A(A) 



+ 0 ( A p h) • 


(29.17) 


Evaluating the (now finite!) integral and rearranging, we have 


A (A) — A p h + ln(A/m p h) 2 + 0(A p h) 


(29.18) 


Note that this result has the problem of a large log ; the second term is 
smaller than the first only if A p h ln(A/m p h) 1. To cure this problem, we 
must change the renormalization scheme. We will take up this issue shortly, 
but first let us examine the case of two external lines while continuing to 
use the on-shell scheme. 
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For the case of two external lines, the one-loop diagram has just one 
vertex, and by momentum conservation, the loop integral is completely 
indepedent of the external momentum. This implies that the one-loop 
contribution to Z( A) vanishes, and so we have 

Z(A) = l + 0(A 2 h ). (29.19) 

The one- loop diagram does, however, give a nonzero contribution to m 2 ( A); 
after including the tree-level term, we find 

1 ro o 1 

-m 2 ( A) = -Z m m ph + -(— A p h) J ^ ^tt ) 4 k 2 + m 2 Y + ' ( 29 - 20 ) 

This integral diverges. To evaluate it, recall that the one-loop contribution 
to the exact particle mass-squared is given by the same diagram, but with 
fields of all momenta circulating in the loop. Thus we have 

^ roc d 4 fc 1 

~ m ph = ~ z mm ph + - (-Aph) J o (2tt) 4 k 2 + m 2 h + ' ( 29 - 21 ) 

Then, subtracting eq. (29.20) from eq. (29.21), we get 

1 r A ii 4 k 1 

-mj h + m 2 (A) = -(-X ph )l y + m 2 h + O(^ph) • (29.22) 

Evaluating the (now finite!) integral and rearranging, we have 

m 2 ( A) = m 2 h - [A 2 - m 2 h ln(A 2 /m 2 h )] + 0(A 2 h ) . (29.23) 

We see that we now have an even worse situation than we did with the 
large log in A(A): the correction term is quadratically divergent. 

As already noted, to fix these problems we must change the renormal- 
ization scheme. In the context of an effective action with a specific value of 
the cutoff Ao, there is a simple way to do so: we simply treat this effective 
action as the fundamental starting point, with Z{ Ao), m 2 (Ao), A(Ao), and 
Q,i(Ao) as input parameters. We then see what physics emerges at energy 
scales well below Ao- We can set Z( Ao) = 1, with the understanding that 
the field no longer has the LSZ normalization (and that we will have to 
correct the LSZ formula to account for this). We will also assume that 
the parameters A(Ao), m 2 (Ao), and c c ]^{A {) ) are all small when measured in 
units of the cutoff: 


A(A 0 ) <C 1 , 

(29.24) 

m 2 (A 0 )| < Ag , 

(29.25) 

c d ,i( A 0 ) < A ” 1 [d ~ 4) . 

(29.26) 
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The proposal to treat the effective action as the fundamental starting 
point may not seem very appealing. For one thing, we now have an infinite 
number of parameters to specify, rather than two! Also, we now have an 
explicit cutoff in place, rather than trying to have a theory that works at 
all energy scales. 

On the other hand, it may well be that quantum field theory does not 
work at arbitrarily high energies. For example, quantum fluctuations in 
spacetime itself should become important above the Planck scale, which is 
given by the inverse square root of Newton’s constant, and has a numerical 
value of ~ 10 19 GeV (compared to, say, the proton mass, which is ~ 1 GeV). 

So, let us leave the cutoff Ao in place for now. We will then make a two- 
pronged analysis. First, we will see what happens at much lower energies. 
Then, we will see what happens if we try to take the limit Ao — » oo. 

We begin by examining lower energies. To make things more tractable, 
we will set 

Cd,i(Ao) = 0 ; (29.27) 

later we will examine the effects of a more general choice. 

A nice way to see what happens at lower energies is to integrate out some 
more high-energy degrees of freedom. Let us, then, perform the functional 
integral over Fourier modes ip(k) with A < \k\ < Ao; we have 

e -S eff ( V ;A) = J VipA<kl<Ao g-SeftOMo) . (29.28) 

We can do this calculation in perturbation theory, mimicking the procedure 
that we used earlier. We find 

1 /-A o nff 1 

” 2< A) = m2(A »> + 2 A,A ">X ( 2 ^ Prfiw + " ' • (29 ' 29) 

A(A) = A(A 0 ) - -A 2 (A 0 )j^ + +••• ■ ( 29 - 3 °) 

(—l) n Mo d A k / 1 \ n 

«-‘< A > = -Wm< a 4 (wlratw) + • (29 - 31) 

where the ellipses stand for higher-order corrections. For A not too much 
less than A 0 (and, in particular, for |m 2 (A 0 )| <C A 2 ), we find 

m 2 ( A) = m 2 (A 0 ) + — ^ . A(A 0 ) (a 2 - A 2 ) + . . . , (29.32) 

A(A) = A(A 0 ) - y|^A 2 (Ao) In ^ + . . . , (29.33) 
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Figure 29.2: A one-loop contribution to the <p 4 vertex for fields with \k\ < A. 
The internal (dashed) line represents a field with \k\ > A. 


We see from this that the corrections to m 2 ( A), which is the only coefficient 
with positive mass dimension, are dominated by contributions from the 
high end of the integral. On the other hand, the corrections to c<i ,;(A), 
coefficients with negative mass dimension, are dominated by contributions 
from the low end of the integral. And the corrections to A(A), which is 
dimensionless, come equally from all portions of the range of integration. 

For the Cd,i( A), this means that their starting values at Ao were not 
very important, as long as eq. (29.26) is obeyed. Nonzero starting values 
would contribute another term of order 1 /Aq” - 4 to C 2 n ,i(A), but all such 
terms are less important than the one of order 1/A 2n_4 that comes from 
doing the integral down to \k\ = A. 

Similarly, nonzero values of Q ,;(Aq) would make subdominant contri- 
butions to A(A). As an example, consider the contribution of the diagram 
in fig. (29.2). Ignoring numerical factors, the vertex factor is C6,i(Ao), and 
the loop integral is the same as the one that enters into m 2 ( A); it yields a 
factor of Aq — A 2 ~ Aq. Thus the contribution of this diagram to A(A) is 
of order C6 ,i(Ao)Aq. This is a pure number that, according to eq. (29.26), 
is small. This contribution is missing the logarithmic enhancement factor 
ln(Ao/A) that we see in eq. (29.33). 

On the other hand, for m 2 ( A), there are infinitely many contributions of 
order Aq when Ao) T 4 0. These must add up to give m 2 ( A) a value that 
is much smaller. Indeed, we want to continue the process down to lower 
and lower values of A, with m 2 ( A) dropping until it becomes of order m 2 h 
at A ~ r/Tph . For this to happen, there must be very precise cancellations 
among all the terms of order Aq that contribute to m 2 ( A). In some sense, it 
is more “natural” to have m 2 h ~ A(Ao)Aq, rather than to arrange for these 
very precise cancellations. This philosophical issue is called the fine-tuning 
problem, and it generically arises in theories with spin-zero fields. 

In theories with higher-spin fields only, the action typically has more 
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symmetry when these fields are massless, and this typically prevents diver- 
gences that are worse than logarithmic. These theories are said to be tech- 
nically natural, while theories with spin-zero fields (with physical masses 
well below the cutoff) generally are not. (The only exceptions are theories 
where supersymmetry relates spin-zero and spin-one-half fields; the spin- 
zero fields then inherent the technical naturalness of their spin-one-half 
partners.) For now, in ip 4 theory, we will simply accept the necessity of 
fine-tuning in order to have m p h <C A. 

Returning to eqs. (29.32-29.34), we can recast them as differential equa- 
tions that tell us how these parameters change with the vaule of the cutoff 
A. In particular, let us do this for A(A). We take the derivative of eq. (29.33) 
with respect to A, multiply by A, and then set Ao = A to get 

^A( a ) = JLa^) + ".. (29 

Notice that the right-hand side of eq. (29.33) is apparently the same as the 
beta function f3(\) = dX/dln/j, that we calculated in problem 28.1, where 
it represented the rate of change in the MS parameter A that was need to 
compensate for a change in the MS renormalization scale p. Eq. (29.33) 
gives us a new physical interpretation of the beta function: it is the rate of 
change in the coefficient of the p A term in the effective action as we vary 
the ultraviolet cutoff in that action. 

Actually, though, there is a technical detail: it is really Z(A)~ 2 A(A) 
that is most closely analogous to the MS parameter A. This is because, if 
we rescale tp so that it has a canonical kinetic term of then the 

coefficient of the p A term is Z( A) _2 A(A). Since Z{A) = 1 + 0(A 2 (A)), this 
has no effect at the one-loop level, but it does matter at two loops. We 
can account for the effect of this wave function renormalization (in all the 
couplings) by writing, instead of eq. (29.11), 

C eS (p-,A) = ^Z(A)d p pd fl p + \Z(A)m 2 (A)p 2 + ^Z 2 (A)A(A)<^ 4 

+ E E Z nd ’^ 2 (A) c dt i (A) O d: i , (29.36) 

d>6 i 



where n ^ j is the number of fields in the operator O^i- Now the beta 
function for A is universal up through two loops; see problem 29.1. At 
three and higher loops, differences with the MS beta function can arise, 
due to the different underlying definitions of the coupling A in the cutoff 
scheme and the MS scheme. 

We now have the overall picture of Wilson’s approach to quantum field 
theory. First, define a quantum field theory via an action with an explicit 
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momentum cutoff in place. 1 Then, lower the cutoff by integrating out 
higher-momentum degrees of freedom. As a result, the coefficients in the 
effective action will change. If the field theory is weakly coupled — which in 
practice means eqs. (29.24-29.26) are obeyed — then the coefficients of the 
operators with negative mass dimension will start to take on the values 
we would have computed for them in perturbation theory, regardless of 
their precise initial values. If we continuously rescale the fields to have 
canonical kinetic terms, then the dimensionless coupling constant (s) will 
change according to their beta functions. The final results, at an energy 
scale E well below the initial cutoff Ao, are the same as we would predict 
via renormalized perturbation theory, up to small corrections by powers of 
E/ Ao- 

The advantage of the Wilson scheme is that it gives a nonperturbative 
definition of the theory which is applicable even if the theory is not weakly 
coupled. With a spacetime lattice providing the cutoff, other techniques 
(typically requiring large-scale computer calculations) can be brought to 
bear on strongly-coupled theories. 

The Wilson scheme also allows us to give physical meaning to nonrenor- 
malizable theories. Given an action for a nonrenormalizable theory, we can 
regard it as an effective action. We should then impose a momentum cutoff 
Ao, where Ao can be defined by saying that the coefficient of every operator 
Oj with mass dimension D* > 4 is given by Cj/A^* -4 with c* < 1. Then 
we can use this theory for physics at energies below Ao- At energies E far 
below Ao, the effective theory will look like a renormalizable one, up to 
corrections by powers of E/Kq. (This renormalizable theory might simply 
be a free-held theory with no interactions, or no theory at all if there are 
no particles with physical masses well below Ao-) 

We now turn to the final issue: can we remove the cutoff completely? 

Returning to the example of if 4 theory, let us suppose that we are 
somehow able to compute the exact beta function. Then we can integrate 
the renormalization-group equation dX / d In A = /3(A) from A = m p h to 
A = Aq to get 


W A o) d \ 


= In 


An 


/a ( m ph ) /3(A) m ph 


(29.37) 


We would like to take the limit Ao — > oo. Obviously, the right-hand side of 
eq. (29.37) becomes infinite in this limit, and so the left-hand side must as 
well. 

However, it may not. Recall that, for small A, /3(A) is positive, and it 


x This can be done in various ways: for example, we could replace continuous spacetime 
with a discrete lattice of points with lattice spacing a; then there is an effective largest 
momentum of order 1/a. 
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increases faster than A. If this is true for all A, then the left-hand side of 
eq. (29.37) will approach a fixed, finite value as we take the upper limit of 
integration to infinity. This yields a maximum possible value for the initial 
cutoff, given by 


A r°° d\ 

iv-max / Uj/X 

m ph A(rn ph ) /3(A) ■ 


(29.38) 


If we approximate the exact beta function with its leading term, 3A 2 /167r 2 , 
and use the leading term in eq. (29.18) to get A(m p h) = A p h, then we find 


A ~ m ,= 167r2 / 3A ph 

1 '■max — ' 1 ‘ph e 


(29.39) 


The existence of a maximum possible value for the cutoff means that we 
cannot take the limit as the cutoff goes to infinity; we must use an effective 
action with a cutoff as our starting point. If we insist on taking the cutoff 
to infinity, then the only possible value of A p h is A p h = 0. Thus, gfi theory 
is trivial in the limit of infinite cutoff: there are no interactions. (There is 
much evidence for this, but as yet no rigorous proof. The same is true of 
quantum electrodynamics, as was first conjectured by Landau; in this case, 
Amax is known as the location of the Landau pole.) 

However, the cutoff can be removed if the beta function grows no faster 
than A at large A; then the left-hand side of eq. (29.37) would diverge as we 
take the upper limit of integration to infinity. Or, /3(A) could drop to zero 
(and then become negative) at some finite value A*. Then, if A p h < A*, the 
left-hand side of eq. (29.37) would diverge as the upper limit of integration 
approaches A*. In this case, the effective coupling at higher and higher 
energies would remain fixed at A* , and A = A* is called an ultravioldet fixed 
point of the renormalization group. 

If the beta function is negative for A = A(m p h), the theory is said to 
be asymptotically free , and A(A) decreases as the cutoff is increased. In 
this case, there is no barrier to taking the limit A — > oo. In four space- 
time dimensions, the only asymptotically free theories are nonabelian gauge 
theories; see section 69. 


Reference Notes 

Effective held theory is discussed in Georgf Peskin & Schroeder , and Wein- 
berg I. An introduction to lattice theory can be found in Smit. 

Problems 

29.1) Consider a theory with a single dimensionless coupling g whose beta 
function takes the form /3(g) = b±g 2 + & 2 S 3 + • • • • Now consider a 
new definition of the coupling of the form g = g + C 2 <? 2 + . . . . 
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a) Show that /3(g) = big 2 + & 25 3 + • • • • 

b) Generalize this result to the case of multiple dimensionless cou- 
plings. 

29.2) Consider p 3 theory in six euclidean spacetime dimensions, with a 
cutoff Ao and lagrangian 

C = \Z{ A 0 )d^d^ + iZ 3 / 2 (AoMAo V s • (29.40) 

We assume that we have fine-tuned to keep m 2 (A) <C A 2 , and so we 
neglect the mass term. 

a) Show that 


Z( A) 

5(A) 


Wl-^ 2 <A 0 )^ 


\ 


r A o dH 


I A (27 r) 6 {k+£) 2 £ 2 


+ 


k 2 = 0 


Z 3 ' 2 { A 0 ; 
Z 3 / 2 (A) 


5 ( Ao )(^1 + g “ 


Ao 


A (27t) 6 (l 2 ) 3 


+ ••• 


Hint: note that the tree-level propagator is A(k) = [Z(Ao)A: 2 ] 1 
b) Use your results to compute the beta function 


«9( a,) ^ A_9 (a) , (29.41) 


and compare with the result in section 27. 
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30 Spontaneous Symmetry Breaking 

Prerequisite: 21 

Consider ip 4 theory, where ip is a real scalar field with lagrangian 

C = — \d y p>d^ip — | m 2 ip 2 — ^A (p 4 . (30.1) 

As we discussed in section 23, this theory has a Z 2 symmetry: C is invari- 
ant under ip(x) — > —ip(x), and we can define a unitary operator Z that 
implements this: 

Z~ 4 ip(x)Z = —:p(x) . (30.2) 

We also have Z 2 = 1, and so Z = Z. Since unitarity implies Z _1 = Z\ 
this makes Z hermitian as well as unitary. 

Now suppose that the parameter m 2 is, in spite of its name, negative 
rather than positive. We can write C in the form 

L = -\& i <pd lt <p-V{ip) , (30.3) 

where the potential is 

V(ip) = \m 2 :p 2 + ±Acp 4 

= - y2 ) 2 ~ ■ ( 30 - 4 ) 

In the second line, we have defined 

v = +(6|m 2 |/A) 1//2 . (30.5) 

We can (and will) drop the last, constant, term in eq. (30.4). 

From eq. (30.4) it is clear that there are two classical field configurations 
that minimize the energy: (p(x) = +v and <p(x) = —v. This is in contrast 
to the usual case of positive m 2 , for which the minimum-energy classical 
field configuration is <p{x) = 0. 

We can expect that the quantum theory will follow suit. For m 2 < 0, 
there will be two ground states, |0+) and 1 0 — ) , with the property that 

(0+|^(x)|0+) = +v , 

(0-|v?(x)|0— ) = -v , (30.6) 

up to quantum corrections from loop diagrams that we will treat in detail 
in section 30. These two ground states are exchanged by the operator Z. 

z\0+) = |0-> , (30.7) 

and they are orthogonal: (0+|0— ) = 0. 
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This last claim requires some comment. Consider a similar problem in 
quantum mechanics, 


H = \p 2 + jgX(x 2 - v 2 ) 2 . (30.8) 

There are two approximate ground states in this case, specified by the 
approximate wave functions 

ip± (x) = (x|0±) ~ exp[— uj(x T v) 2 /2] , (30.9) 

where u> = (Au 2 /3) 1 / 2 is the frequency of small oscillations about the mini- 
mum. However, the true ground state is a symmetric linear combination of 
these. The antisymmetric linear combination has a slightly higher energy, 
due to the effects of quantum tunneling. 

We can regard a field theory as an infinite set of oscillators, one for each 
point in space, each with a hamiltonian like eq. (30.8), and coupled together 
by the (V<p) 2 term in the field-theory hamiltonian. There is a tunneling 
amplitude for each oscillator, but to turn the field-theoretic state |0+) into 
1 0 — ) , all the oscillators have to tunnel, and so the tunneling amplitude gets 
raised to the power of the number of oscillators, that is, to the power of 
infinity (more precisely, to a power that scales like the volume of space). 
Therefore, in the limit of infinite volume, (0+|0— ) vanishes. 

Thus we can pick either |0+) or |0— ) to use as the ground state. Let 
us choose |0+). Then we can define a shifted field, 

p(x) = ip(x) — v , (30.10) 

which obeys (0+|p(x)|0+) = 0. (We must still worry about loop corrections, 
which we will do at the end of this section.) The potential becomes 

V(v) = [{ P + v) 2 -v 2 ] 2 

= \^v 2 p 2 + \\vp i + ^Ap 4 , (30.11) 

and so the lagrangian is now 

C = -\d ,Jj pd tl p - \\v 2 p 2 - gA' vp 3 - tjjA/3 4 . (30.12) 

We see that the coefficient of the p 2 term is gAu 2 = |m 2 |. This coefficient 
should be identified as \m 2 , where m p is the mass of the corresponding p 
particle. Also, we see that the shifted field now has a cubic as well as a 
quartic interaction. 

Eq. (30.12) specifies a perfectly sensible, renormalizable quantum field 
theory, but it no longer has an obvious Z 2 symmetry. We say that the Z 2 
symmetry is spontaneously broken. 
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This leads to a question about renormalization. If we include renormal- 
izing Z factors in the original lagrangian, we get 

£ = - \Z m m 2 p> 2 - ±Z X \p 4 ■ (30.13) 

For positive m 2 , these three Z factors are sufficient to absorb infinities for 
d < 4, where the mass dimension of A is positive or zero. On the other 
hand, looking at the lagrangian for negative m 2 after the shift, eq. (30.12), 
we would seem to need an extra Z factor for the p 3 term. Also, once we 
have a p 3 term, we would expect to need to add a p term to cancel tadpoles. 
So, the question is, are the original three Z factors sufficient to absorb all 
the divergences in the Feynman diagrams derived from eq. (30.13)? 

The answer is yes. To see why, consider the quantum action (introduced 
in section 21) 

r(v) = \J ■?(-*:) (* 2 +™ 2 -n (k 2 ))if(k) 

x V n (fci, . . . , k n ) ip(ki) . . . (f(k n ) , (30.14) 

computed with m 2 > 0. The ingredients of r(y?)- the self-energy II(£: 2 ) 
and the exact 1PI vertices V n — are all made finite and well-defined (in, 
say, the MS renormalization scheme) by adjusting the three Z factors in 
eq. (30.13). Furthermore, for in 2 > 0, the quantum action inherits the Z 2 
symmetry of the classical action. To see this directly, we note that V ra 
must zero for odd n, simply because there is no way to draw a 1PI diagram 
with an odd number of external lines using only a four-point vertex. Thus 
r(<£>) also has the Z 2 symmetry. This is a simple example of a more general 
result that we proved in problem 21.2: the quantum action inherits any 
linear symmetry of the classical action, provided that it is also a symmetry 
of the integration measure "Dip. ( Linear means that the transformed fields 
are linear functions of the original ones.) The integration measure is almost 
always invariant; when it is not, the symmetry is said to be anomalous. We 
will meet an anomalous symmetry in section 75. 

Once we have computed the quantum action for m 2 > 0, we can go 
ahead and consider the case of m 2 < 0. Recall from section 21 that the 
quantum equation of motion in the presence of a source is <5T /S<p(x) = 
— J(x), and that the solution of this equation is also the vacuum expectation 
value of ip{x). Now set J(x) = 0, and look for a translationally invariant 
(that is, constant) solution ip(x) = v. If there is more than one such 
solution, we want the one(s) with the lowest energy. This is equivalent to 
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minimizing the quantum potential U(tp), where 


r 0) 


d A x 


-U{<p)~ \Z{ip)d^pd^ip + . . . 


(30.15) 


and the ellipses stand for terms with more derivatives. In a weakly coupled 
theory, we can expect the loop-corrected potential U {ip) to be qualitatively 
similar to the classical potential V{ip). Therefore, for m 2 < 0, we expect 
that there are two minima of U {ip) with equal energy, located at ip(x) = ±v, 
where v = (0|<£>(x)|0) is the exact vacuum expectation value of the field. 

Thus we have a description of spontaneous symmetry breaking in the 
quantum theory based on the quantum action, and the quantum action 
is made finite by adjusting only the three Z factors that appear in the 
original, symmetric form of the lagrangian. 

In the next section, we will see how this works in explicit calculations. 
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31 Broken Symmetry and Loop Corrections 

Prerequisite: 30 

Consider ip 4 theory, where (p is a real scalar field with lagrangian 

C = -^Z^pd^ip - \Z m m 2 ip 2 - ^gZ\\(p 4 . (31.1) 

In d = 4 spacetime dimensions, the coupling A is dimensionless. 

We begin by considering the case m 2 > 0, where the Z 2 symmetry of C 
under <p — ► —<p is manifest. We wish to compute the three renormalizing 
Z factors. We work in d = 4 — s dimensions, and take A — > \p £ (where p 
has dimensions of mass) so that A remains dimensionless. 

The propagator correction n(A; 2 ) is given by the diagrams of fig. (31.1), 
which yield 

in(A: 2 ) = i(-?:A/C)iA(0) - i(Ak 2 + Bm 2 ) , (31.2) 

where A = Z^ — 1 and B = Z m — 1, and 

<3L3) 

Using the usual bag of tricks from section 14, we find 

/2 e A(0) = \ - + 1 + ln(^ 2 /m 2 ) m 2 , (31-4) 

(47r) z L £ 

where p 2 = Ane^p 2 . Thus 

n(A; 2 ) = ^ , 9 - + 1 + In (p 2 /m 2 ) m 2 — Ak 2 — Bm 2 . (31.5) 

2 ( 47 t )^ [ e \ 

From eq. (31.5) we see that we must have 

A = 0( A 2 ) , (31.6) 

B = 16^(7 + KB ) + °( A2 ) ’ ( 3L7 ) 

where kb is a finite constant (that may depend on p). In the MS renor- 
malization scheme, we take kb = 0, but we will leave k b arbitrary for 
now. 

Next we turn to the vertex correction, given by the diagram of fig. (31.2), 
plus two others with hp k% and k^ ky all momenta are treated as 
incoming. We have 

*V 4 (fci, k 2 , A: 3 , ki) = —iZ\X + ^(-?'A) 2 (i) iF(—s) + iF(—t ) + iF(—u ) 

+ 0(A 3 ). (31.8) 
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k k 


Figure 31.1: 0(A) corrections to II (k 2 ). 


I + k i + h 2 


Figure 31.2: The 0(A 2 ) correction to V^/ci, k, 2 , k$, £q). Two other dia- 
grams, obtained from this one via k -2 k% and ^2 fc 4 , also contribute. 


Here we have defined s = — (k\ + /C 2 ) 2 , t = —{k\ + k^) 2 , u = —{k\ + ki) 2 , 
and 


iF(k 2 ) = fl £ j 


(2n) d (( i+k ) 2 + m 2 )(£ 2 + m 2 ) 


i T 2 I" 
167T 2 _£+ Jo 


+ [ dx In (p 2 /D) 
Jo 


(31.9) 


where D = x(l—x)k 2 + m 2 . Setting Z\ = 1 + C in eq. (31.8), we see that 
we need 

C = ~ yryy + 0(A 2 ) , (31.10) 


16 vr 2 Ve ) v " v ’ 

where kc is a finite constant. 

We may as well pause to compute the beta function, /3(A) = dX/dln / 1 , 
where the derivative is taken with the bare coupling Ao held fixed, and 
the finite parts of the counterterms set to zero, in accord with the MS 
prescription. We have 

Ao = Z\ , (31.11) 


{z x zy) 


iS4 +0(A ' 2) - 


(31.12) 


Let Ti(A) be the coefficient of 1/e in eq. (31.12). Our analysis in section 
28 shows that the beta function is then given by /3(A) = X 2 L' 1 (X). Thus we 
find 

o\2 

/3(A) = ——r, + 0(A 3 ) . (31.13) 
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Figure 31.3: The O(A) correction to the vacuum expectation value of the 
p held. 

The beta function is positive, which means that the theory becomes more 
and more strongly coupled at higher and higher energies. 

Now we consider the more interesting case of m 2 < 0, which results in 
the spontaneous breakdown of the Z 2 symmetry. 

Following the procedure of section 30, we set tp(x) = p{x) + v, where 
v = (6|m 2 |/A) 1 / 2 minimizes the potential (without Z factors). Then the 
lagrangian becomes (with Z factors) 

C = -\Z^pd p p- \{\Z x -\Z m )m 2 p p 2 

+ \{Z m -Z x )(3/\~p £ ) ll2 mlp 

~ \Z x (3\p £ ) 1/2 r n p p 3 - ± Z x \p £ p 4 , (31.14) 

where m 2 = 2|m 2 |. Now we can compute various one- loop corrections. 

We begin with the vacuum expectation value of p. The 0(A) correction 
is given by the diagrams of fig. (31.3). The three-point vertex factor is 
—iZ\g 3 , where g 3 can be read off of eq. (31.14): 

g 3 = (3A p £ ) 1/2 m p . (31.15) 

The one-point vertex factor is iY, where Y can also be read off of eq. (31.14): 

Y = \^Z m -Z x ){3l\p £ ) x ' 2 ml . (31.16) 

Following the discussion of section 9, we then find that 

(0|/9(x)|0) = [iY + \{-iZ x gz) iA(0)) J d 4 y\A(x-y) , (31.17) 

plus higher-order corrections. Using eqs. (31.15) and (31.16), and eq. (31.4) 
with m 2 m 2 , the factor in large parentheses in eq. (31.17) becomes 

-(3/X) l l 2 m? p ^Z m —Z\ + ^ + 1 + ln(^ 2 /m 2 ) + 0(A 2 )^ . (31.18) 

Using Z rn = 1 + B and Z\ = 1 + C, with B and C from eqs. (31.7) and 
(31.10), the factor in large parentheses in eq. (31.18) becomes 

k b - kc + 1 + br( / u 2 /m 2 ) . 


(31.19) 
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All the l/e’s have canceled. The remaining finite vacuum expectation value 
for p(x) can now be removed by choosing 

kb — kq = —1 — In (p 2 /m 2 ) . (31.20) 

This will also cancel all diagrams with one- loop tadpoles. 

Next we consider the p propagator. The diagrams contributing to the 
0(A) correction are shown in fig. (31.4). The counterterm insertion is — iX, 
where, again reading off of eq. (31.14), 

X = Ak 2 + {\C - \B)m 2 p . (31.21) 

Putting together the results of eq. (31.2) for the first diagram (with m 2 — > 
to 2 ), eq. (31.9) for the second (ditto), and eq. (31.21) for the third, we get 

n (k 2 ) = - i (A/i e )iA(0) + \g 2 F(h 2 ) — X + 0( A 2 ) 

= 3^2 m p \ + 1 + ln (M 2 Mp) 

— Ak 2 — (|0 — jB)m 2 + 0(A 2 ). (31.22) 

Again using eqs. (31.7) and (31.10) for B and 0, we see that all the 1/e’s 
cancel, and we’re left with 

If (k 2 ) = 7^2 to 2 1 + In {p 2 /m 2 ) + dx In (g 2 /D) + |(9 k c - k b ) 

+ 0( A 2 ). (31.23) 

We can now choose to work in an OS scheme, where we require II(— m, 2 ) = 0 
and n'(— m 2 ) = 0. We see that, to this order in A, n(/c 2 ) is independent of 
k 2 . Thus, we automatically have IT(— m 2 ) = 0, and we can choose 9 kc~k b 
to fix II(— m 2 ) = 0. Together with eq. (31.20), this completely determines 
k b and kq to this order in A. 

Next we consider the one-loop correction to the three-point vertex, given 
by the diagrams of fig. (31.5). We wish to show that the infinities are 
canceled by the value of Z\ = 1 + C that we have already determined. The 
first diagram in fig. (31.5) is finite, and so for our purposes we can ignore 
it. The remaining three, plus the original vertex, sum up to give 

®V 3 (fei, &2, fc 3 )div = -iZ\g 3 + \{-i\)(-ig 3 ) (i) 

x [iF(kj) + iF{kl) + iF{kl) 

+ 0(A 5/2 ) , 


(31.24) 
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Figure 31.4: 0(A) corrections to the p propagator. 



Figure 31.5: 0(A) corrections to the vertex for three p fields. 

where the subscript div means that we are keeping only the divergent part. 
Using eq. (31.9), we have 

V 3 (k 1 ,k 2 ,k 3 ) div = -g 3 (l + C-^^ + 0(\ 2 )^ . (31.25) 

From eq. (31.10), we see that the divergent terms do indeed cancel to this 
order in A. 

Finally, we have the correction to the four-point vertex. In this case, 
the divergent diagrams are just those of fig. (31.1), and so the calculation 
of the divergent part of V4 is exactly the same as it is when m 2 > 0 (but 
with m.p in place of m). Since we have already done that calculation (it 
was how we determined C in the first place), we need not repeat it. 

We have thus seen how we can compute the divergent parts of the coun- 
terterms in the simpler case of m 2 > 0, where the Z 2 symmetry is unbroken, 
and that these counterterms will also serve to cancel the divergences in the 
more complicated case of m 2 < 0, where the Z 2 symmetry is spontaneously 
broken. This a general rule for renormalizable theories with spontaneous 
symmetry breaking, regardless of the nature of the symmetry group. 

Reference Notes 

Another example of renormalization of a spontaneously broken theory is 
worked out in Peskin & Schroeder. 
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32 Spontaneous Breaking of Continuous 

Symmetries 

Prerequisite: 22, 30 

Consider the theory (introduced in section 22) of a complex scalar field (p 
with 

C = —d^ip^d^ip — m 2 ip^ip — jA^V) 2 • (32.1) 

This lagrangian is obviously invariant under the U(l) transformation 

ip(x) —> e~ ia (p(x) , (32.2) 

where a is a real number. 

Now suppose that m 2 is negative. The minimum of the potential of 
eq. (32.1) is achieved for 

¥>(®) = > (32.3) 

where 

v = (4|m 2 |/A) 1/2 , (32.4) 

and the phase 9 is arbitrary; the factor of root-two in eq. (32.3) is con- 
ventional. Thus we have a continuous family of minima of the potential, 
parameterized by 8. Under the U(l) transformation of eq. (32.2), 9 changes 
to 9 + cc; thus the different minimum-energy field configurations are all 
related to each other by the symmetry. 

In the quantum theory, we therefore expect to find a continuous family 
of ground states, labeled by 9 , with the property that 

<%MI 0} = y 2 ve- ,e . (32.5) 

Also, according to the discussion in section 30, we expect (9'\9) = 0 for 
& + 8 . 

Returning to classical language, there is a flat direction in field space 
that we can move along without changing the energy. The physical con- 
sequence of this is the existence of a massless particle called a Goldstone 
boson. 

Let us see how this works in more detail. We first choose the phase 
9 = 0 , and then write 

<p(x) = -^[v + a(x) + ib(x)\ , (32.6) 

where a and b are real scalar fields. Substituting eq. (32.6) into eq. (32.1), 
we find 

C = —^d^ad^a — ^d IJ bd IL b 

- \m 2 \a 2 - |A 1/2 |m|a(a 2 + b 2 ) - ^A (a 2 + b 2 ) 2 . (32.7) 
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We see from this that the a field has a mass given by ^m 2 = \m 2 \. The b 
held, on the other hand, is massless, and we identify it as the Goldstone 
boson. 

A different parameterization brings out the role of the massless held 
more clearly. We write 

<p(x) = ^(v + p(x))e- i * x V v , (32.8) 

where p and x are real scalar helds. Substituting eq. (32.8) into eq. (32.1), 
we get 

C = -\d Vpd^p - 3(1 + Q d IL xd ll x 

— \m 2 \p 2 — ^ A 1//2 |77T.|p 3 — jQ^p 4 ■ (32.9) 

We see from this that the p held has a mass given by ^m 2 = \m 2 \, and that 
the y held is massless. These are the same particle masses we found using 
the parameterization of eq. (32.6). This is not an accident: the particle 
masses and scattering amplitudes are independent of held redefinitions. 

Note that the x held does not appear in the potential at all. Thus it 
parameterizes the hat direction. In terms of the p and x fields, the U(l) 
transformation takes the simple form x( x ) x( x ) + a - 

Does the masslessness of the x held survive loop corrections? It does. 
To see this, we note that if the x held remains massless, its exact propagator 
A ^{k 2 ) should have a pole at k 2 = 0; equivalently, the self-energy II X (A; 2 ), 
related to the propagator by A x {k 2 ) = 1/[A: 2 — II X (A: 2 )], should satisfy 
11(0) =0. 

We can evaluate II x (0) by summing all 1PI diagrams with two external 
X lines, each with four-momentum k = 0. We note from eq. (32.9) that the 
derivatives acting on the x fields imply that the vertex factors for the pxx 
and ppxx vertices are each proportional to k\ -k‘ 2 , where k\ and k 2 are the 
momenta of the two x lines that meet at that vertex. Since the external 
lines have zero momentum, the attached vertices vanish; hence, II v (0) = 0, 
and the x particle remains massless. 

The same conclusion can be reached by considering the quantum action 
T(<£>), which includes all loop corrections. According to our discussion in 
section 29, the quantum action has the same symmetries as the classical 
action. Therefore, in the case at hand, 

r(¥>) = r (e- <£ V) . (32.10) 

Spontaneous symmetry breaking occurs if the minimum of T(<^) is at a 
constant, nonzero value of i p>. Because of eq. (32.10), the phase of this 
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constant is arbitrary. Therefore, there must be a flat direction in field 
space, corresponding to the phase of <p(x). The physical consequence of 
this flat direction is a massless particle, the Goldstone boson. 

All of this has a straightforward extension to the nonabelian case. Con- 
sider 

C. — 2^ ^Pi^n^Pi 2^ PiPi 16 ^(jpiPi) , (32.11) 

where a repeated index is summed. This lagrangian is invariant under the 
infinitesimal SO(N ) transformation 

Stpi = -i9 a {T a ) ij v j , (32.12) 

where runs from 1 to ^N(N-l), 6 a is a set of ^N(N-l) real, infinitesi- 
mal parameters, and each antisymmetric generator matrix T a has a single 
nonzero entry —i above the main diagonal, and a corresponding +i below 
the main diagonal. 

Now let us take m 2 < 0 in eq. (32.11). The minimum of the potential 
is achieved for ipflx) = ty, where v 2 = ViVi = 4|m 2 |/A, and the direction 
in which the IV-component vector v points is arbitrary. In the quantum 
theory, we interpret Vi as the vacuum expectation value (VEV for short) 
of the quantum field ipflx). We can choose our coordinate system so that 
Vi = v5iN ; that is, the VEV lies entirely in the last component. 

Now consider making an infinitesimal SO (IV) transformation. This 
changes the VEV; we have 


Vi * Vi iO a (T a )ijVj 

= v6 iN - ie a (T a ) iN v . (32.13) 

For some choices of 6 a , the second term on the right-hand side of eq. (32.13) 
vanishes. This happens if the corresponding T a has no nonzero entry in the 
last column. There are N— 1 T a ’s with a nonzero entry in the last column: 
those with the —i in the first row and last column, in the second row and 
last column, etc, down to the N— 1 th row and last column. These T a, s are 
said to be broken generators. A generator is broken if ( T a )ijVj / 0, and 
unbroken if ( T a )ijVj = 0. 

An infinitesimal SO(A^) transformation that involves a broken genera- 
tor changes the VEV of the field, but not the energy. Thus, each broken 
generator corresponds to a flat direction in field space. Each flat direction 
implies the existence of a corresponding massless particle. This is Gold- 
stone’s theorem : there is one massless Goldstone boson for each broken 
generator. 

The unbroken generators, on the other hand, do not change the VEV 
of the field. Therefore, after rewriting the lagrangian in terms of shifted 
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fields (each with zero VEV), there should still be a manifest symmetry cor- 
responding to the set of unbroken generators. In the present case, the num- 
ber of unbroken generators is ^N(N-l) — (IV— 1) = i(IV— 1)(IV— 2). This 
is the number of generators of SO(IV— 1). Therefore, we expect SO(IV— 1) 
to be an obvious symmetry of the lagrangian after it is written in terms of 
shifted fields. 

Let us see how this works in the present case. We can rewrite eq. (32.11) 
as 

£ = -±& i 'p i d li 'p i -V(<p), (32.14) 

with 

V (<p) = {(pm - v 2 ) 2 , (32.15) 

where v = (4|m 2 |/A) 1//2 , and the repeated index i is implicitly summed 
from 1 to N. Now let pn(x) = v + p(x), and plug this into eq. (32.14). 
With the repeated index i now implicitly summed from 1 to IV— 1, we have 

C = -^ipid^ipi - ^d ll pd lt p - V(p, p) , (32.16) 


where 


V(p,(p) = jq\[{v+p ) 2 + <pig>i - v 2 } 2 
= ^A(2 vp + p 2 + ipig>i ) 2 

= \\v 2 p 2 + \\vp(p 2 + <pnpi) + jq\(p 2 + <pmf ■ (32.17) 

There is indeed a manifest SO(IV— 1) symmetry in eqs. (32.16) and (32.17). 
Also, the IV— 1 (pi fields are massless: they are the expected IV— 1 Goldstone 
bosons. 


Reference Notes 

Further discussion of Goldstone’s theorem can be found in Georgi, Peskin 
& Schroeder, and Weinberg II. 

Problems 

32.1) Consider the Noether current j 11 for the U(l) symmetry of eq. (32.1), 
and the corresponding charge Q. 

a) Show that e~ ia ® tp e + ia Q = e +ia ip. 

b) Use eq. (32.5) to show that e~ ia( ^\9) = \9 + a). 

c) Show that Q|0) / 0; that is, the charge does not annihilate the 
9 = 0 vacuum. Contrast this with the case of an unbroken symmetry. 
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32.2) In problem 24.3, we showed that [</?j, Q a ] = ( T a )ijPj, where Q a is the 
Noether charge in the SO (IV) symmetric theory. Use this result to 
show that Q a |0) ^ 0 if and only if Q a is broken. 

32.3) We define the decay constant f of the Goldstone boson via 

(k\j^(x)\0) = ifk^e~ ikx , (32.18) 

where | k) is the state of a single Goldstone boson with four- momentum 
k, normalized in the usual way, |0) is the 0 = 0 vacuum, and j ll (x) is 
the Noether current. 

a) Compute / at tree level. (That is, express j 11 in terms of the p 
and x fields, and then use free field theory to compute the matrix 
element.) A nonvanishing value of / indicates that the corresponding 
current is spontaneously broken. 

b) Discuss how your result would be modified by higher-order correc- 
tions. 



Part II 

Spin One Half 
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33 Representations of the Lorentz Group 

Prerequisite: 2 

In section 2, we saw that we could define a unitary operator 17(A) that 
implemented a Lorentz transformation on a scalar field ip(x) via 

U(A)~ 1 <p(x) 17(A) = (p(A~ l x) . (33.1) 

As shown in section 2, this implies that the derivative of the field transforms 
as 

U(A)~ 1 d IJ 'ip(x)U(A) = A p p d p ip{A~ l x) , (33.2) 

where the bar on he derivative means that it is with respect to the argument 
x = A~ 1 x. 

Eq. (33.2) suggests that we could define a vector field A p (x) that would 
transform as 

17(A)" 1 A p (x)U{A) = A p p A p (A~ 1 x) , (33.3) 

or a tensor field B ,l,J (x) that would transform as 

U(A)~ 1 (x)U (A) = A P pA u a B pa (A~ 1 x) . (33.4) 

Note that if B pu is either symmetric, B pu (x) = B up, (x), or antisymmetric, 
B pu (x) = —B up (x), then the symmetry is preserved by the Lorentz trans- 
formation. Also, if we take the trace to get T(x) = gp V B pv (x), then, using 
gp U A p p A v a = g pa , we find that T(x ) transforms like a scalar field, 

U (A)~ 1 T (x)U (A) = T(A"G) . (33.5) 

Thus, given a tensor field B pu (x) with no particular symmetry, we can 
write 

B pu {x) = A pu {x) + S pu {x) + \g pv T(x) , (33.6) 

where A ^ is antisymmetric ( A p,v = —A vp ) and S pv is symmetric ( S p,v = 
S up ) and traceless {g^pS^ = 0). The key point is that the fields A pv , S pu , 
and T do not mix with each other under Lorentz transformations. 

Is it possible to further break apart these fields into still smaller sets 
that do not mix under Lorentz transformations? How do we make this 
decomposition into irreducible representations of the Lorentz group for a 
field carrying n vector indices? Are there any other kinds of indices we 
could consistently assign to a field? If so, how do these behave under a 
Lorentz transformation? 

The answers to these questions are to be found in the theory of group 
representations. Let us see how this works for the Lorentz group (in four 
spacetime dimensions) . 



33: Representations of the Lorentz Group 


212 


Consider a field (not necessarily hermitian) that carries a generic Lorentz 
index, ip A (x). Under a Lorentz transformation, we have 

U{A.)-\ a (x)U{K) = LA B (A)ip B (A~ 1 x) , (33.7) 

where L a b (A ) is a matrix that depends on A. These finite-dimensional 
matrices must obey the group composition rule 

L a b (A')L b c (A) = La C ( A'A) . (33.8) 

We say that the matrices L a b (A) form a representation of the Lorentz 
group. 

For an infinitesimal transformation A /i „ = 5 P V + we can write 

U(l+Su) = 1+ l&cvAP" , (33.9) 

where the operators M 111 ' are the generators of the Lorentz group. As 
shown in section 2, the generators obey the commutation relations 

[M /M/ , M pCT ] = i(g pp M va - (/h-h/)) - (pe^a) , (33.10) 

which specify the Lie algebra of the Lorentz group. 

We can identify the components of the angular momentum operator J as 
Ji = \>£ijkM ]k and the components of the boost operator K as K, = M i0 . 


We then find from eq. (33.10) that 


[Jit Jj\ jk^k ? 

(33.11) 

[Ji-, Kj\ — i 

(33.12) 

[Ki-, Kj\ = i^ijkJk • 

(33.13) 

For an infinitesimal transformation, we also have 


L A B (1+Su) = 5 a b + , 

(33.14) 

Eq. (33.7) then becomes 


[cp A (x),M^] = jL^Mx) + (S p ")a B Vb(x) , 

(33.15) 


where C pv = \{x p d l/ — x u d p ). Both the differential operators C pv and the 
representation matrices {S pi, ) A B must separately obey the same commuta- 
tion relations as the generators themselves; see problems 2.8 and 2.9. 

Our problem now is to find all possible sets of finite-dimensional matri- 
ces that obey eq. (33.10), or equivalently eqs. (33.11-33.13). Although the 
operators M ,w must be hermitian, the matrices (S pu ) A B need not be. 
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If we restrict our attention to eq. (33.11) alone, we know (from stan- 
dard results in the quantum mechanics of angular momentum) that we 
can find three (2j + l) x (2j+l) hermitian matrices J\ , J-j , and Ji that 
obey eq. (33.11), and that the eigenvalues of (say) J-$ are —j, —j+ 1, . . . , +j, 
where j has the possible values 0, 1, . . . . We further know that these ma- 

trices constitute all of the inequivalent, irreducible representations of the 
Lie algebra of SO(3), the rotation group in three dimensions. Inequivalent 
means not related by a unitary transformation; irreducible means cannot be 
made block-diagonal by a unitary transformation. (The standard deriva- 
tion assumes that the matrices are hermitian, but allowing nonhermitian 
matrices does not enlarge the set of solutions.) Also, when j is a half inte- 
ger, a rotation by 27 t results in an overall minus sign; these representations 
of the Lie algebra of SO (3) are therefore actually not represenations of the 
group SO(3), since a 27 t rotation should be equivalent to no rotation. As 
we saw in section 24, the Lie algebra of SO(3) is the same as the Lie algebra 
of SU(2); the half-integer representations of this Lie algebra do qualify as 
representations of the group SU(2). 

We would like to extend these conclusions to encompass the full set of 
eqs. (33.11 -33.13). In order to do so, it is helpful to define some nonher- 
mitian operators whose physical significance is obscure, but which simplify 


the commutation relations. These are 

Ni = i(Jj -iKi) , (33.16) 

N} = \{Ji+iKi) . (33.17) 

In terms of N t and Nj , eqs. (33.11-33.13) become 

Wi,Nj] = i£ijkN k , (33.18) 

[NlN}]=ie ijk Nt, (33.19) 

[N U N}] = 0. (33.20) 


We see that we have two different SU(2) Lie algebras that are exchanged 
by hermitian conjugation. As we just discussed, a representation of the 
SU(2) Lie algebra is specified by an integer or half integer; we therefore 
conclude that a representation of the Lie algebra of the Lorentz group in 
four spacetime dimensions is specified by two integers or half-integers n 
and n' . 

We will label these representations as (2ra+l, 2?r / +l); the number of 
components of a representation is then (2n+l)(2n / +l). Different compo- 
nents within a representation can also be labeled by their angular mo- 
mentum representations. To do this, we first note that, from eqs. (33.16) 
and (33.17), we have Ji = TV,; + AT. Thus, deducing the allowed values 
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of j given n and n' becomes a standard problem in the addition of an- 
gular momenta. The general result is that the allowed values of j are 
\n—n'\, \n—n'\+l , . . . , n+n', and each of these values appears exactly once. 

The four simplest and most often encountered representations are (1,1), 

(2, 1), (1,2), and (2,2). These are given special names: 

(1.1) = Scalar or singlet 

(2. 1) = Left-handed spinor 

(1.2) = Right-handed spinor 

(2.2) = Vector (33.21) 

It may seem a little surprising that (2, 2) is to be identified as the vector 
representation. To see that this must be the case, we first note that the 
vector representation is irreducible: all the components of a four-vector 
mix with each other under a general Lorentz transformation. Secondly, the 
vector representation has four components. The only candidate irreducible 
representations are (4, 1), (1,4), and (2,2). The first two of these contain 
angular momenta j = \ only, whereas (2,2) contains j = 0 and j = 1. 
This is just right for a four- vector, whose time component is a scalar under 
spatial rotations, and whose space components are a three-vector. 

In order to gain a better understanding of what it means for (2, 2) to 
be the vector representation, we must first investigate the spinor represen- 
tations (1,2) and (2,1), which contain angular momenta j = | only. 

Reference Notes 

An extended treatment of representations of the Lorentz group in four 
dimensions can be found in Weinberg I. 

Problems 


33.1) Express A ttv (x), S ,iv (x). and T(x) in terms of B l,M (x). 

33.2) Verify that eqs. (33.18-33.20) follow from eqs. (33.11-33.13). 
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34 Left- and Right-Handed Spinor Fields 

Prerequisite: 3, 33 

Consider a left-handed spinor field fi a (x). also known as a left-handed Weyl 
field , which is in the (2, 1) representation of the Lie algebra of the Lorentz 
group. Here the index a is a left-handed spinor index that takes on two 
possible values. Under a Lorentz transformation, we have 

U(A)-VaHMA) = L a b (A)M A-H) , (34.1) 

where L a b ( A) is a matrix in the (2, 1) representation. These matrices satisfy 
the group composition rule 

L a b (A')L b c (A) = L a c ( A'A) . (34.2) 


For an infinitesimal transformation A^'y = 6^ v + 5u> fl l/ , we can write 

L b {l+5u) = 8 a b + i^(5Da 6 , (34.3) 

where (S'R) a b = -{S^) a b is a set of 2 x 2 matrices that obey the same 
commutation relations as the generators M 1 ' 1 ' , namely 

^ , Sffi} = i(f>S? - (jjiwj) - (/?<— >cr) . (34.4) 

Using 

U(l+6u) = 1 + , (34.5) 

eq. (34.1) becomes 

[Mx),M^] = C^Mx) + (SDa b Mx) , (34.6) 

where C ,llJ = |(a4‘cP — x u d M ). The O aJ term in eq. (34.6) would also be 
present for a scalar field, and is not the focus of our current interest; we 
will suppress it by evaluating the fields at the spacetime origin, = 0. 
Recalling that AP- 7 = e i: > k J kj where J k is the angular momentum operator, 
we have 

e ijk [M0), Jk } = (Si j ) a b M 0) . (34.7) 

Recall that the (2, 1) representation of the Lorentz group includes an- 
gular momentum j = \ only. For a spin-one-half operator, the standard 
convention is that the matrix on the right-hand side of eq. (34.7) is ^e l ^ k Ok, 
where we have suppressed the row index a and the column index b, and 
where is a Pauli matrix: 





(34.8) 
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We therefore conclude that 


(Si j )a b = \e ijk cr k , (34.9) 

Thus, for example, setting i= 1 and j= 2 yields ( S ^ 2 )a b = ^£ 12k &k = 503 , 
and so (S^i 1 = +±, (S L 12 ) 2 2 = -±, and (S?), 2 = (5 L 12 ) 2 1 = 0 . 

Once we have the ( 2 , 1 ) representation matrices for the angular momen- 
tum operator J*, we can easily get them for the boost operator K k = M k0 . 
This is because J k = N k + nJ. and K k = i{N k — N' k ), and, acting on a 
field in the (2, 1) representation, N t is zero. Therefore, the representation 
matrices for K k are simply i times those for J k , and so 

(S k \ b = \ia k . (34.10) 

Now consider taking the hermitian conjugate of the left-handed spinor 
field fi a (x). Recall that hermitian conjugation swaps the two SU(2) Lie 
algebras that comprise the Lie algebra of the Lorentz group. Therefore, 
the hermitian conjugate of a field in the ( 2 , 1 ) representation should be a 
held in the ( 1 , 2 ) representation; such a held is called a right-handed spinor 
field or a right-handed Weyl field. We will distinguish the indices of the 
( 1 , 2 ) representation from those of the ( 2 , 1 ) representation by putting dots 
over them. Thus, we write 

. (34.11) 

Under a Lorentz transformation, we have 

t/(A)-Vt(x)U(A) = Ra b (A)fi>^ (A - 1 x) , (34.12) 

where Ra b ( A) is a matrix in the (1, 2) representation. These matrices satisfy 
the group composition rule 

Ra\A')R h d (A) = RafiA'A) . (34.13) 

For an infinitesimal transformation A ,x u = d^ u + Su^^, we can write 

Ra b (l+Scu) = Sa b + , (34.14) 

where (S^)a b = -{S^fi)a b is a set of 2 x 2 matrices that obey the same 
commutation relations as the generators A'H W . We then have 

[^(0),MH = (^ndV|(0). (34.15) 

Taking the hermitian conjugate of this equation, we get 

[m^,mo)} = [(sn^rMo) ■ 


(34.16) 
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Comparing this with eq. (34.6), we see that 

(sna b = -i(sna b r • ( 34 . 17 ) 

In the previous section, we examined the Lorentz-transformation prop- 
erties of a field carrying two vector indices. To help us get better acquainted 
with the properties of spinor indices, let us now do the same for a field that 
carries two (2, 1) indices. Call this field C ab (x). Under a Lorentz transfor- 
mation, we have 

U(A)~ 1 C ab {x)U(A) = L a c (A)L b d (A)C cd (A~ 1 x) . (34.18) 

The question we wish to address is whether or not the four components of 
C ab can be grouped into smaller sets that do not mix with each other under 
Lorentz transformations. 

To answer this question, recall from quantum mechanics that two spin- 
one-half particles can be in a state of total spin zero, or total spin one. 
Furthermore, the single spin-zero state is the unique antisymmetric com- 
bination of the two spin-one-half states, and the three spin-one states are 
the three symmetric combinations of the two spin-one-half states. We can 
write this schematically as 2 <8) 2 = 1 a © 3s , where we label the representa- 
tion of SU(2) by the number of its components, and the subscripts S and 
A indicate whether that representation appears in the symmetric or anti- 
symmetric combination of the two 2’s. For the Lorentz group, the relevant 
relation is (2, 1) <g) (2, 1) = (1, 1)a © (3, l)s- This implies that we should be 
able to write 

C ab (x) = s ab D(x) + G ab (x ) , (34.19) 

where D(x) is a scalar field, e a b = — £ b a is an antisymmetric set of constants, 
and G ab (x) = Gb a (x). The symbol e ab is uniquely determined by its sym- 
metry properties up to an overall constant; we will choose £21 = — £ 12 = +1. 

Since D(x) is a Lorentz scalar, eq. (34.19) is consistent with eq. (34.18) 
only if 

L a c (A)L b d (A)s cd = e ab . (34.20) 

This means that £ ab is an invariant symbol of the Lorentz group: it does 
not change under a Lorentz transformation that acts on all of its indices. 
In this way, e ab is analogous to the metric g /M/ , which is also an invariant 
symbol, since 

A/ A v °g pa = g„ v . (34.21) 

We use g^ iu and its inverse g^ u to raise and lower vector indices, and 
we can use e ab and and its inverse e ah to raise and lower left-handed spinor 
indices. Here we define e ab via 

£ 12 = £ 2 i = +1, £ 21 =£i2 = -l. (34.22) 
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With this definition, we have 

e ab e bc = 5 a c , e ab e bc = 5 a c . (34.23) 

We can then define 

ip a (x) ee e ab Mx) ■ (34.24) 

We also have (suppressing the spacetime argument of the field) 

'4 > a = £ab^ b = £ab£ bC ^c = , (34.25) 

as we would expect. However, the antisymmetry of e ab means that we 
must be careful with minus signs; for example, eq. (34.24) can be written 
in various ways, such as 

^ = e ab ii> b = -£ ba A = -*l) b £ ba = ^ b £ ab . (34.26) 

We must also be careful about signs when we contract indices, since 

Vxa = e ab AXa = -£ ba ^bXa = ~Ax b • (34.27) 

In section 35, we will (mercifully) develop an index-free notation that au- 
tomatically keeps track of these essential (but annoying) minus signs. 

An exactly analogous discussion applies to the second SU(2) factor; 
from the group-theoretic relation (1,2) <8> (1,2) = (1, 1)a © (l,3)s, we can 
deduce the existence of an invariant symbol e .; = — £; . . We will normalize 
£ ab according to eq. (34.22). Then eqs. (34.23-34.27) hold if all the undotted 
indices are replaced by dotted indices. 

Now consider a field carrying one undotted and one dotted index, A 0 a(x). 
Such a field is in the (2, 2) representation, and in section 33 we concluded 
that the (2, 2) representation was the vector representation. We would more 
naturally write a field in the vector representation as A^(x). There must, 
then, be a dictionary that gives us the components of A a a(x) in terms of 
the components of A ll (x)'. we can write this as 

A a a(x) = a^A^x) , (34.28) 

where cr© is another invariant symbol. That such a symbol must exist can 
be deduced from the group-theoretic relation 

(2, 1) © (1, 2) © (2, 2) = (1, 1) © . . . . (34.29) 

As we will see in section 35, it turns out to be consistent with our already 
established conventions for S^ u and to choose 


(/,<?) . 


(34.30) 
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Thus, for example, crT = +1, o'L = -1, o~L = o f = 0. 

In general, whenever the product of a set of representations includes 
the singlet, there is a corresponding invariant symbol. For example, we can 
deduce the existence of g ^ = g Ufl from 

(2, 2) © (2, 2) = (1, l) s © (1, 3 ) a © (3, 1) A © (3, 3) s • (34.31) 

Another invariant symbol, the Levi-Civita symbol , follows from 

(2, 2) © (2, 2) © (2, 2) © (2, 2) = (1, 1) A © - - - , (34.32) 

where the subscript A denotes the completely antisymmetric part. The 
Levi-Civita symbol is £p u P a , which is antisymmetric on exchange of any 
pair of its indices, and is normalized via e 0123 = +1. To see that £P u P a is 
invariant, we note that A^A^ ( gA p 7 A CT is antisymmetric on exchange 
of any two of its uncontracted indices, and therefore must be proportional 
to eP v P a . The constant of proportionality works out to be detA, which is 
+1 for a proper Lorentz transformation. 

We are finally ready to answer a question we posed at the beginning of 
section 33. There we considered a field B fJ,u (x ) carrying two vector indices, 
and we decomposed it as 

B^{x) = A^(x) + S^{x) + \g^ u T{x) , (34.33) 

where A^ v is antisymmetric {AP V = —A u p) and S^ v is symmetric = 
S Vfl ) and traceless {g^S^ = 0). We asked whether further decomposition 
into still smaller irreducible representations was possible. The answer to 
this question can be found in eq. (34.31). Obviously, T(x) corresponds to 
(1, 1), and S^ u (x) to (3, 3). 1 But, according to eq. (34.31), the antisymmet- 
ric held A^ v (x) should correspond to (3,1) © (1,3). A held in the (3,1) 
representation carries a symmetric pair of left-handed (undotted) spinor 
indices; its hermitian conjugate is a held in the (1,3) representation that 
carries a symmetric pair of right-handed (dotted) spinor indices. We should, 
then, be able to find a mapping, analogous to eq. (34.28), that gives AP u (x) 
in terms of a held G a b(x) and its hermitian conjugate G^(x). 

This mapping is provided by the generator matrices S{ a/ and S£ v . We 
hrst note that the Pauli matrices are traceless, and so eqs. (34.9) and 
(34.10) imply that (S^ u ) a a = 0. Using eq. (34.24), we can rewrite this 
as e ab (S£ v ) a b = 0. Since e ab is antisymmetric, must be symmetric 

on exchange of its two spinor indices. An identical argument shows that 

1 Note that a symmetric traceless tensor has three independent diagonal components, 
and six independent off-diagonal components, for a total of nine, the number of compo- 
nents of the (3, 3) representation. 
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(Sr v )ab rnus t be symmetric on exchange of its two spinor indices. Further- 
more, according to eqs. (34.9) and (34.10), we have 

(^°) a 6 = -i(Sl 3 ) a b . (34.34) 

This can be written covariantly with the Levi-Civita symbol as 

(SDa b = -j 2 s tlupa (S hpa )a b . (34.35) 

Similarly, 

(S£V = +y^(S Rpa )a b • (34.36) 

Eq. (34.36) follows from taking the complex conjugate of eq. (34.35) and 
using eq. (34.17). 

Now, given a field G a b(x) in the (3,1) representation, we can map it 
into a self-dual antisymmetric tensor G pL, (x) via 

G^(x) = ( Sn ab G ab (x ) . (34.37) 

By self-dual , we mean that G pu {x) obeys 

GTix) = ~^ vpa G pa {x) . (34.38) 

Taking the hermitian conjugate of eq. (34.37), and using eq. (34.17), we get 

G^(x) = -(S pu ) db Gl h (x) , (34.39) 

which is anti-self-dual , 

G^ u {x) = +y^ ptT Gl a (x) . (34.40) 

Given a hermitian antisymmetric tensor field A pv {x), we can extract its 
self-dual and anti-self-dual parts via 

G pu {x) = \A pu (x) - £ £ pup<T A pa (x) , (34.41) 

G^ u {x) = \A pu (x) + £ £ pupa A pa {x) . (34.42) 

Then we have 

A pv (x) = G pv (x) + (x) . (34.43) 

The field G pv {x) is in the (3, 1) representation, and the held G' pu (x) is in 
the (1,3) representation; these do not mix under Lorentz transformations. 

Problems 

34.1) Verify that eq. (34.6) follows from eq. (34.1). 
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34.2) Verify that eqs. (34.9) and (34.10) obey eq. (34.4). 

34.3) Show that the Levi-Civita symbol obeys 

= - SVpS^ - <5 W P « - ^-yS\5P p 

+ 5 W p a , (34.44) 

e^ pa £ a p P a = -2 Wp - S^6\) , (34.45) 

E PUpa £aupa = . (34.46) 

34.4) Consider a field C a ... C a...c(x), with N undotted indices and M dotted 
indices, that is furthermore symmetric on exchange of any pair of un- 
dotted indices, and also symmetric on exchange of any pair of dotted 
indices. Show that this field corresponds to a single irreducible rep- 
resentation (2n+l, 2n / +l) of the Lorentz group, and identify n and 
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35 Manipulating Spinor Indices 

Prerequisite: 34 

In section 34 we introduced the invariant symbols £ ab , £ ab , £■;, and £ ab , 
where 

£ 12 = e i2 = £ 21 = £ 2 i = +1 , £ 21 = £ 2i = £12 = £ j 2 = -1 • (35.1) 

We use the e symbols to raise and lower spinor indices, contracting the 
second index on the e. (If we contract the first index instead, then there is 
an extra minus sign). 

Another invariant symbol is 

al = (I,a), (35.2) 

where I is the 2x2 identity matrix, and 



are the Pauli matrices. 

Now let us consider some combinations of invariant symbols with some 
indices contracted, such as cA. This object must also be invariant. 

Then, since it carries two undotted and two dotted spinor indices, it must 
be proportional to £ a b^ab ■ Using eqs. (35.1) and (35.2), we can laboriously 
check this; it turns out to be correct. 1 The proportionality constant works 
out to be minus two: 

°aa°»bb = ~ 2e ab£ab • ( 35 - 4 ) 

Similarly, e ab £ a V(k <jL must be proportional to </‘‘ u , and the proportionality 
constant is again minus two: 

c ab ^ b <y bb = -2 sT • (35-5) 

Next, let’s see what we can learn about the generator matrices ( S £ l ') a b 
and (S'R^)a 6 from the fact that £ ab , £ db , and a 1 ! are all invariant symbols. 
Begin with 

£ ab = L(A) a c L(A) b d e cd , (35.6) 

which expresses the Lorentz invariance of £ ab . For an infinitesimal trans- 
formation A^ v = + 5 io^u, we have 

L a \l+Su ) = 5 a b + i^(sr)a & , (35.7) 

x If it did not turn out to be correct, then eq. (35.2) would not be a viable choice of 
numerical values for this symbol. 
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and eq. (35.6) becomes 

e ab = e ab + Ul pu (S Pl/ ) a C £ c b + (S PU )b d £ a d + 0(5u J 2 ) 

= e ab + — (S P )ab + (S P )ba + 0(5uj 2 ) . (35.8) 

Since eq. (35.8) holds for any choice of 5 u pi> , it must be that the factor in 

square brackets vanishes. Thus we conclude that (S pu ) a b = {S pl, )b a , which 
we had already deduced in section 34 by a different method. Similarly, 
starting from the Lorentz invariance of e^, we can show that (Sr*')^ = 

Next, start from 

<a = A p rL(A) a b R(A)a h al b , (35.9) 

which expresses the Lorentz invariance of a p -. For an infinitesimal trans- 
formation, we have 

A P T = S P T + f<Sav(Sr) p r , (35.10) 

L a b (l+Su) = ,5 a b + iS^(Sr) a b , (35.11) 

Ra\l+5i0) = 5a h + i8u lxu (Sg')a i} , (35.12) 

where 

(sn p r = \(g pp 5\ - g vp 8\) • (35.13) 

Substituting eqs. (35. 10-35. 13) into eq. (35.9) and isolating the coefficient 
of 5cj pv yields 

(« f p 8% ~ g up 8 p r)a r aa + + i(SnMb = 0 • (35.14) 

Now multiply by a pc c to get 

°cc a aa - VccVaa + *(-S T)a °ba° pec + i{S^ ) b 0^0 pcd = 0 . (35.15) 

Next use eq. (35.4) in each of the last two terms to get 

- a cc a aa + ^{SDacSdc + 2 i(S^)ac£ aC = 0 . (35.16) 

If we multiply eq. (35.16) by £ ac , and remember that £ ac {S£ u )ac = 0 and 
that £ ac £ac = ~2, we get a formula for (S pu ) ac . namely 

(ST). = ic u «,< - «) ■ 


(35.17) 
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Similarly, if we multiply eq. (35.16) by e ac , we get 

(ST). = - «) ■ (35.18) 

These formulae can be made to look a little nicer if we define 

a*** = £ ab £ dh a^ . (35.19) 

Numerically, it turns out that 

a* ha = (/, -a) . (35.20) 

Using d IJ \ we can write eqs. (35.17) and (35.18) as 

(SDa b = - o v a») a h , (35.21) 

= (35.22) 

In eq. (35.22), we have suppressed a contracted pair of undotted indices 
arranged as c c , and in eq. (35.21), we have suppressed a contracted pair of 
dotted indices arranged as c c - 

We will adopt this as a general convention: a missing pair of contracted, 
undotted indices is understood to be written as c c , and a missing pair of 
contracted, dotted indices is understood to be written as b c ■ Thus, if x and 
•0 are two left-handed Weyl fields, we have 

XV’ = X>a and xV 1 ’ = x\^ a • (35.23) 

We expect Weyl fields to describe spin-one-half particles, and (by the spin- 
statistics theorem) these particles must be fermions. Therefore the corre- 
spoding fields must anticommute, rather than commute. That is, we should 
have 

Xa{x)^ b {y) = -Mv)Xa(x) . (35.24) 

Thus we can rewrite eq. (35.23) as 

xV’ = X a ^a = -^aX a = fXa = V’X • (35.25) 

The second equality follows from anticommutation of the fields, and the 
third from switching a a to a a (which introduces an extra minus sign). 
Eq. (35.25) tells us that xV’ = Vw which is a nice feature of this notation. 
Furthermore, if we take the hermitian conjugate of xV5 we § e t 

(xVO f = (x“V’a) t = (V’a) t (x“) t = V’!x ta = V’V • (35.26) 

That (xVO f = 'y X' is J us t what we would expect if we ignored the indices 
completely. Of course, by analogy with eq. (35.25), we also have V’V = 
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In order to tell whether a spinor field is left-handed or right-handed 
when its spinor index is suppressed, we will adopt the convention that a 
right-handed field is always written as the hermitian conjugate of a left- 
handed field. Thus, a right-handed field is always written with a dagger, 
and a left-handed field is always written without a dagger. 

Let’s try computing the hermitian conjugate of something a little more 
complicated: 

V^X = 4^“ C Xc • (35.27) 

This behaves like a vector field under Lorentz transformations, 

17(A)- 1 [^a^xP(A) = A^[V’V X ] . (35.28) 

(To avoid clutter, we suppressed the spacetime argument of the fields; as 
usual, it is x on the left-hand side and A~ 1 x on the right.) The hermitian 
conjugate of eq. (35.27) is 

= [V ’i^ dc Xc] f 

= 

= Xt^a 

= yVV • (35.29) 

In the third line, we used the hermiticity of the matrices = (I, — a). 

We will get considerably more practice with this notation in the follow- 
ing sections. 


Problems 

35.1) Verify that eq. (35.20) follows from eqs. (35.2) and (35.19). Hint: 
write everything in “matrix multiplication” order, and note that, nu- 
merically, e ab = —e a b = fog • Then make use of the properties of the 
Pauli matrices. 

35.2) Verify that eq. (35.21) is consistent with eqs. (34.9) and (34.10). 

35.3) Verify that eq. (35.22) is consistent with eq. (34.17). 

35.4) Verify eq. (35.5). 
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36 Lagrangians for Spinor Fields 

Prerequisite: 22, 35 


Suppose we have a left-handed spinor field 4> a . We would like to find a 
suitable lagrangian for it. This lagrangian must be Lorentz invariant, and 
it must be hermitian. We would also like it to be quadratic in ^ and 
its hermitian conjugate because this will lead to a linear equation of 
motion, with plane-wave solutions. We want plane-wave solutions because 
these describe free particles, the starting point for a theory of interacting 
particles. 

Let us begin with terms with no derivatives. The only possibility is 
V’V’ = 'tpv-'i/ja = s^^b^a, plus its hermitian conjugate. Because of anticom- 
mutation of the fields (V’&V’a = — V’aV’fe)) this expression does not vanish (as 
it would if the fields commuted), and so we can use it as a term in C. 

Next we need a term with derivatives. The obvious choice is 
plus its hermitian conjugate. This, however, yields a hamiltonian that is 
unbounded below, which is unacceptable. To get a bounded hamiltonian, 
the kinetic term must involve both and 4- A candidate is 
This not hermitian, but 

= -id^t 
= -id^l d^a 
= i4d^ a d^ a -id^\ d^a). 

= i4 d^d^tp — id/j,(4 d^ip) ■ (36.1) 


In the third line, we used the hermiticity of the matrices d ^ = (I, — <r). In 
the fourth line, we used — (dA)B = AdB — d(AB). In the last line, the 
second term is a total divergence, and vanishes (with suitable boundary 
conditions on the fields at infinity) when we integrate it over d 4 x to get 
the action S. Thus i^d^d^y has the hermiticity properties necessary for 
a term in C. 

Our complete lagrangian for ^ is then 

C = i^d^d^y — > (36.2) 


where m is a complex parameter with dimensions of mass. The phase of m 
is actually irrelevant: if m = |m|e*“, we can set = e~ ioi l 2 y in eq. (36.2); 
then we get a lagrangian for that is identical to eq. (36.2), but with m 
replaced by \m\. So we can, without loss of generality, take m to be real 
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and positive in the first place, and that is what we will do, setting m* = m 
in eq. (36.2). 

The equation of motion for tp is then 

s s 

0 = + mip* , (36.3) 

Restoring the spinor indices, this reads 

0 = -id^ c d^ c + rmp^ . (36.4) 

Taking the hermitian conjugate (or, equivalently, computing —5S/5ip), we 
get 

0 = dyrfl + rmp a 

= +id^ a d^\ + mip a 

= -i^ac d^ C + • (36.5) 


In the second line, we used the hermiticity of the matrices d^ = (I,— a). 
In the third, we lowered the undotted index, and switched c c to c c , which 
gives an extra minus sign. 

Eqs. (36.5) and (36.4) can be combined to read 


/ mS a c —i a ac 

\ — i(T /toc d lt md a c ) V V’ tc / 


(36.6) 


We can write this more compactly by introducing the 4x4 gamma matrices 



(36.7) 


Using the sigma-matrix relations, 


Kit" + a u d fJ ') a c = -2 grS a c , 

{d^a u + d u a^f d = , (36.8) 


which are most easily derived from the numerical formulae crU = (/, o) and 
a^aa — —a), we see that the gamma matrices obey 


{r, Y} = -2gT , 


(36.9) 


where {A,B} = AB + BA denotes the anticommutator, and there is an 
understood 4x4 identity matrix on the right-hand side. We also introduce 
a four-component Majorana field 




Ipc 

V>t c 


(36.10) 
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Then eq. (36.6) becomes 


(—i-y^d^ + m)T = 0 . 


(36.11) 


This is the Dirac equation. We first encountered it in section 1, where the 
gamma matrices were given different names (/3 = 7 0 and a k = 7 ° 7 fc ). Also, 
in section 1 we were trying (and failing) to interpret f as a wave function, 
rather than as a quantum field. 

Now consider a theory of two left-handed spinor fields with an SO (2) 
symmetry, 

C = - \rmf\if\ , (36.12) 

where the spinor indices are suppressed and i = 1,2 is implicitly summed. 
As in the analogous case of two scalar fields discussed in sections 22 and 
23, this lagrangian is invariant under the SO (2) transformation 


'Yh ' 

^2. 


cos a sm a 


— sm a cos a 


,1p2. 


(36.13) 


We can write the lagrangian so that the SO(2) symmetry appears as a U(l) 
symmetry instead; let 

X = 7 j(Vh +^ 2 ) , (36.14) 

£ = TfWh -# 2 ) • (36.15) 

In terms of these fields, we have 

C = ix^^d^x + - mxZ - m^x* ■ (36.16) 

Eq. (36.16) is invariant under the U(l) version of eq. (36.13), 

X e~ ia x , 

£ -> . (36.17) 


Next, let us derive the equations of motion that we get from eq. (36.16), 
following the same procedure that ultimately led to eq. (36.6). The result 
is 


m5 a c 
—i(f IJac d,. 


- i(T ac d u\ t Xc 
m5 a r 



= 0 . 


(36.18) 


We can now define a four-component Dirac field 



T = 


(36.19) 
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which obeys the Dirac equation , eq. (36.11). (We have annoyingly used the 
same symbol T to denote both a Majorana field and a Dirac field; these are 
different objects, and so we must always announce which is meant when we 
write 'I'.) 

We can also write the lagrangian, eq. (36.16), in terms of the Dirac field 
\F, eq. (36.19). First we take the hermitian conjugate of \F to get 

= (xi , n • (36.20) 

Introduce the matrix 

( 0 <5 a A 

f3=[ . (36.21) 

W 0 ) 

Numerically, /3 = 7 0 . However, the spinor index structure of (3 and 7 0 is 
different, and so we will distinguish them. Given (3 , we define 


* = = (e, xi) • 

(36.22) 

Then we have 


^ = CXa + X k td - 

(36.23) 

Also, 


V'fdp* = e<c + xi ^ c d^Xc . 

(36.24) 


Using AdB = — (8A)B + d(AB), the first term on the right-hand side of 
eq. (36.24) can be rewritten as 


d,g = - (V)4 7 + 7) ■ (36.25) 

Then the first term on the right-hand side of eq. (36.25) can be rewritten 
as 

~m a Kc ^ = +4 . (36.26) 

Here we used anticommutation of the fields to get the first equality, and 
switched c c to c c and a “ to “ a (thus generating two minus signs) to get 
the second. Combining eqs. (36.24-36.26), we get 

+ d^a^) • (36.27) 

Therefore, up to an irrelevant total divergence, we have 

C = - mf'I' . (36.28) 

This form of the lagrangian is invariant under the U(l) transformation 

T -»• e~ ia T , 


,+*“ iTf 


( 36 . 29 ) 
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which, given eq. (36.19), is the same as eq. (36.17). The Noether current 
associated with this symmetry is 

j ,J ' = ■ (36.30) 

In quantum electrodynamics, the electromagnetic current is eVky^ik, where 
e is the charge of the electron. 

As in the case of a complex scalar field with a U(l) symmetry, there is 
an additional discrete symmetry, called charge conjugation , that enlarges 
SO(2) to 0(2). Charge conjugation simply exchanges x and £. We can 
define a unitary charge conjugation operator C that implements this, 

C~ l Xa{x)C = £ 0 (x) , 

C~ l U(x)C = Xa(x) , (36.31) 

where, for the sake of precision, we have restored the spinor index and 
spacetime argument. We then have C~ 1 £(x)C = C{x). 

To express eq. (36.31) in terms of the Dirac field, eq. (36.19), we first 
introduce the charge conjugation matrix 



(36.32) 


Next we notice that, if we take the transpose of <k, eq. (36.22), we get 

17 T = ^ j . (36.33) 

Then, if we multiply by C, we get a field that we will call T 0 , the charge 
conjugate of 'k, 

= CT t = ^ j . (36.34) 

We see that T c is the same as the original field 'k, eq. (36.19), except that 
the roles of x an d £ have been switched. We therefore have 

C ,-1 \k(x)C' = T c (x) (36.35) 

for a Dirac field. 

The charge conjugation matrix has a number of useful properties. As a 
numerical matrix, it obeys 

= -c, 


c T = c f = C- 1 


(36.36) 
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and we can also write it as 

( -£ ac 0 

C ” V 0 £ac 

A result that we will need later is 

ab 0 \ / 0 


c _ Vc = 


o ■; 


6c 


0 £ 


ab ' 


V (T^ C 0 
£ ab a ^. £ ce \ 


F -(T^ bc F 




^ce 

0 


Li 

~°ae 


(36.37) 


(36.38) 


The minus signs in the last line come from raising or lowering an index by 
contracting with the first (rather than the second) index of an e symbol. 
Comparing with 


f 0 



(36.39) 


we see that 

C"VC = -(Y*) T . (36.40) 


Now let us return to the Majorana field, eq. (36.10). It is obvious that a 
Majorana field is its own charge conjugate, that is, = T. This condition 
is analogous to the condition gfi = g> that is satisfied by a real scalar field. 
A Dirac field, with its U(l) symmetry, is analogous to a complex scalar 
field, while a Majorana field, which has no U(l) symmetry, is analogous to 
a real scalar field. 

We can write our original lagrangian for a single left-handed spinor field, 
eq. (36.2), in terms of a Majorana field, eq. (36.10), by retracing eqs. (36.20- 
36.28) with y — > V 7 aR d £ — ► if). The result is 


C = - imff . (36.41) 


However, we cannot yet derive the equation of motion from eq. (36.41) 
because it does not yet incorporate the Majorana condition \h c = 4/. To 
remedy this, we use eq. (36.36) to write the Majorana condition T = CT t 
as f = \h T C. Then we can replace T in eq. (36.41) by T T C to get 

C = . (36.42) 


The equation of motion that follows from this lagrangian is once again the 
Dirac equation. 
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We can also recover the Weyl components of a Dirac or Majorana field 
by means of a suitable projection matrix. Define 


75 = 




(36.43) 


where the subscript 5 is simply part of the traditional name of this matrix, 
rather than the value of some index. Then we can define left and right 
projection matrices 


Pr = £( 1 - 75 ) 
Pr = £(i + 75) 


Thus we have, for a Dirac field, 



P^ = 



Pr^ = 



The matrix 75 can also be expressed as 


•01 2 3 
75 = *7 7 7 7 




(36.44) 


(36.45) 


(36.46) 


where e 0 i 23 = — 1 . 

Finally, let us consider the behavior of a Dirac or Majorana field under 
a Lorentz transformation. Recall that left- and right-handed spinor fields 


transform according to 

U{ A)-Va(sMA) = L(A) a c ^c(A- 1 x) , (36.47) 

C7(A)"Vt(s)C7(A) = R(A)/V;|(A- 1 x) , (36.48) 

where, for an infinitesimal transformation = 5^ u + dio ll u , 

L(l+5uj) a c = 5 a c + ^cu^(SDa C , (36.49) 

R( 1+<M A = , (36.50) 
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and where 

(■ srv = +1 - ^^)a c , (36.51) 

(SD“o = -{{ftW - aVfe . (36.52) 

From these formulae, and the definition of 7 ^, eq. (36.7), we can see that 

/+(sr)« c 0 \ 

i[y, ^=( 0 (36 - 53) 

Then, for either a Dirac or Majorana field T, we can write 

17(A) -1 * (®)17(A) = D(A)*(A -1 ®) , (36.54) 

where, for an infinitesimal transformation, the 4x4 matrix D( A) is 

D(l+6u) = 1 + , (36.55) 

with S' 11 ' given by eq. (36.53). The minus sign in front of S£ 1 ' in eq. (36.53) 
is compensated by the switch from a c c contraction in eq. (36.50) to a c C 
contraction in eq. (36.54). 

Problems 

36.1) Using the results of problem 2.9, show that, for a rotation by an angle 
0 about the z axis, we have 

D( A) = exp(— iOS 12 ) , (36.56) 

and that, for a boost by rapidity p in the z direction, we have 

D{ A) = exp (+iijS 30 ) . (36.57) 

36.2) Verify that eq. (36.46) is consistent with eq. (36.43). 

36.3) a) Prove the Fierz identities 

(Xi^X2)(X3^mX4) = - 2 (XiX3)(X2X4) , (36.58) 

(Xi^X2)(X3^mX4) = (Xi^ m X4)(X3^X2) • (36.59) 

b) Define the Dirac fields 
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Use eqs. (36.58) and (36.59) to prove the Dirac form of the Fierz 
identities, 

(*i7^P L tf 2 )(tf 37 ^* 4 ) = -2(^ 1 P r ^)(^P l ^ 2 ) , (36.61) 

(^ l7 ^P L ^ 2 )(^ 37/i P L ^ 4 ) = (^ l7 ^P L ^ 4 )(^ 37/i P L ^ 2 ) . (3 6 . 6 2) 

c) By writing both sides out in terms of Weyl fields, show that 

®iVW 2 = , (36.63) 

= +WiP L ^{ , (36.64) 

*iPr*2 = • (36.65) 

Combining eqs. (36.63-36.65) with eqs. (36.61-36.62) yields more use- 
ful forms of the Fierz identities. 

36.4) Consider a field <pa{x) in an unspecified representation of the Lorentz 
group, indexed by A, that obeys 

U(A)~ 1 <pa(x)U(A) = L A B (A)<p B (A~ 1 x) . (36.66) 

For an infinitesimal transformation, 

L a B (1+5uj) = 5 a B + {Su^S^a 3 ■ (36.67) 

a) Following the procedure of section 22, show that the energy-momentum 
tensor is 

T> w = f_r£ - dr<p A ■ (36.68) 

dldflipA) 

b) Show that the Noether current corresponding to a Lorentz trans- 
formation is 

M pMp = x u T pp - x p T pu + B lwp , (36.69) 

where 

r)C 

B pup = -i— -(S" p )a B Pb ■ (36.70) 

d{d^ A ) 

c) Use the conservation laws d ll T pi ' = 0 and d lt A4 IJ ' l ' p = 0 to show 
that 

T up _ T pu + Q ijB av P = o . (36.71) 

d) Define the improved energy-momentum tensor or Belinfante tensor 

Q pu = T pu + i dp ( B wv _ B w» _ . (36.72) 
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Show that Q pv is symmetric: Q pv = 0' y,i . Also show that 0 /iJy is 
conserved, = 0, and that / d 3 x0° 1 ' = / d 3 xT 0u = P u , where 

P" is the energy-momentum four-vector. (In general relativity, it is 
the Belinfante tensor that couples to gravity.) 

e) Show that the improved tensor 

E pup = x v Q pp - x p Q pu (36.73) 

obeys d^E pup = 0, and that / d 3 x E 0up = f d 3 x M 0up = M vp , where 
M vp are the Lorentz generators. 

f) Compute Q pi/ for a left-handed Weyl field with C given by eq. (36.2), 
and for a Dirac field with C given by eq. (36.28). 

36.5) Symmetries of fermion fields. (Prerequisite: 24.) Consider a theory 
with N massless Weyl fields i/jj, 

C = , (36.74) 

where the repeated index j is summed. This lagrangian is clearly 
invariant under the U(./V) transformation, 

i>j Ujrfk , (36.75) 

where U is a unitary matrix. State the invariance group for the 
following cases: 

a) N Weyl fields with a common mass rn, 

C = iip^a^d^j - + 'i/’jV’j) • (36.76) 

b) N massless Majorana fields, 

C = . (36.77) 

c) N Majorana fields with a common mass rn. 

C = . (36.78) 

d) N massless Dirac fields, 

c = . (36.79) 

e) N Dirac fields with a common mass rn. 

C = . (36.80) 
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37 Canonical Quantization of Spinor Fields I 

Prerequisite: 36 

Consider a left-handed Weyl field ijj with lagrangian 

C = iift cr^d^ — \m{ ip'ip + CVC • (37.1) 

The canonically conjugate momentum to the field i^ a {x) is then 1 

a, \ - dC 

71 X ~ d{d 0 ^ a {x)) 

= i^l(x)d od - a . (37.2) 

The hamiltonian is 


H = TT a do^a ~ C 
= - C 

= — d l diip + + CVQ • 

(37.3) 

The appropriate canonical anticommutation relations are 

{V’o(x,i),^c(y,i)} = o , 

(37.4) 

{'ip a {x,t),n c (y,t)} = i5 a c <5 3 (x — y) . 

(37.5) 

Substituting in eq. (37.2) for 7 r c , we get 

{Mx,t),4(y,t)}d odc = 5 a c 5*(x-y). 

(37.6) 

Then, using cr° = <t° = /, we have 

(V’ a (x,t),'0-(y,t)} =cr°.,5 3 (x-y) , 

(37.7) 

or, equivalently, 

{^ a (x,t),#(y,0} = ^ a ^ 3 (x-y). 

(37.8) 


We can also translate this into four-component notation for either a 
Dirac or a Majorana field. A Dirac field is defined in terms of two left- 
handed Weyl fields x and £ via 



(37.9) 


1 Here we gloss over a subtlety about differentiating with respect to an anticommuting 
object; we will take up this topic in section 44, and for now simply assume that eq. (37.2) 
is correct. 
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We also define 

* = &0 = (?, X , t), (37.10) 

where 

( 0 5\\ 

d = • (37.11) 

\5 a C 0 ) 

The lagrangian is 

C = - m(y£ + X f ) 

= IT - ?nT4' . (37.12) 

The fields x and £ each obey the canonical anticommutation relations of 
eq. (37.5). This translates into 


{T Q (x,t),T /3 (y,t)} = 0 , 
{T Q (x,t),T /3 (y,t)} = (7°) q /3 < 5 3 (x — y) , 
where a and /3 are four-component spinor indices, and 

y=( 0 *). 

V o^ ac 0 ) 


(37.13) 

(37.14) 


(37.15) 


Eqs. (37.13) and (37.14) can also be derived directly from the four-component 
form of the lagrangian, eq. (37.12), by noting that the canonically conjugate 
momentum to the field 4' is dC/d(d o'?) = fTy 0 , and that ( 7 0 ) 2 = 1. 

A Majorana field is defined in terms of a single left-handed Weyl field 
V’ via 



(37.16) 


We also define 

® = *t/ 5 =( ^t). (37.17) 


A Majorana field obeys the Majorana condition 


T = T T C , 


(37.18) 


where 

°) 

^ac ) 

is the charge conjugation matrix. The lagrangian is 
C = i^o^d^ — |m('i/u/ ; + 

= i^Y'dn'F - 

= iV T C'fd li V - . 



(37.19) 


(37.20) 
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The field V’ obeys the canonical anticommutation relations of eq. (37.5). 
This translates into 

{T a (x,t),T /3 (y,f)} = (C7°) a/3 <5 3 (x — y) , (37.21) 

{T a (x, t), T (9 ( y, t)} = (7°)a/3 ^ 3 (x - y) , (37.22) 

where a and /3 are four-component spinor indices. To derive eqs. (37.21) and 
(37.22) directly from the four-component form of the lagrangian, eq. (37.20), 
requires new formalism for the quantization of constrained systems. This is 
because the canonically conjugate momentum to the field 'F is dC/d(do'&) = 
|T t C 7 °, and this is linearly related to T itself; this relation constitutes a 
constraint that must be solved before imposition of the anticommutation 
relations. In this case, solving the constraint simply returns us to the Weyl 
formalism with which we began. 

The equation of motion that follows from either eq. (37.12) or eq. (37.20) 
is the Dirac equation, 

(-ifi + m)T = 0 . (37.23) 

Here we have introduced the Feynman slash: given any four-vector a we 
define 

i = . (37.24) 

To solve the Dirac equation, we first note that if we act on it with 
i(j) + m, we get 


0 = (ift + m){— i<j) + m)^> 

= 

= (-3 2 + m 2 )T. (37.25) 

Here we have used 

H = a a a u 

= a„a„(i{^ 7 "} + i[7*.7‘']) 

= a + i[ 7 ",7l) 

= —ci^a u g^ w + 0 

= -a 2 . (37.26) 


From eq. (37.25), we see that T obeys the Klein-Gordon equation. There- 
fore, the Dirac equation has plane-wave solutions. Let us consider a specific 
solution of the form 


T(x) = u(p)e ipx + v(p)e~ ipx . 


(37.27) 
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where p° = lo = (p 2 + tb 2 ) 1 / 2 , and it(p) and u(p) are four-component 
constant spinors. Plugging eq. (37.27) into the eq. (37.23), we get 

(jf + m)u{ p)e ipx + (~y( + m)v( p)e~ ipx = 0 . (37.28) 

Thus we require 


(i> + m)u( p) = 0 , 

{-i + m)v(p) = 0 . (37.29) 


Each of these equations has two linearly independent solutions that we 
will call u±(p) and u±(p); their detailed properties will be worked out in 
the next section. The general solution of the Dirac equation can then be 
written as 


'p(x) = 53 

s=d= 


dp [MpH(p)e^ + 4(p)u s (p)e-^ 


(37.30) 


where the integration measure is as usual 


dp = 


d 3 p 

(2tt) 3 2lo 


Problems 


(37.31) 


37.1) Verify that eqs. (37.13) and (37.14) follow from eqs. (37.4) and (37.5). 
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38 Spinor Technology 

Prerequisite: 37 

The four-component spinors it s (p) and u s (p) obey the equations 

(rf + m)u s ( p) = 0 , 

(-tf + m)v s ( p) = 0 . 


(38.1) 


Each of these equations has two solutions, which we label via s = + and 
s = — . For in ^ 0, we can go to the rest frame, p = 0. We will then 
distinguish the two solutions by the eigenvalue of the spin matrix 


(38.2) 


^ = ![7 1 ,7 2 ] = |7 1 7 2 =^ o 

Specifically, we will require 

S>±(0) = ±\u-_ t(0) 
S z v±( 0) = =F±u±(0) 


(38.3) 


The reason for the opposite sign for the v spinor is that this choice results 


[j z ,bim = ±±&i(o), 

[t,4(o)] = ±±4(o), 


(38.4) 


where J z is the z component of the angular momentum operator. Eq. (38.4) 
implies that 4(0) an d 4(0) each creates a particle with spin up along the 
z axis. We will verify eq. (38.4) in problem 39.2. 

For p = 0, we have $ = —my 0 , where 


(38.5) 


Eqs. (38.1) and (38.3) are then easy to solve. Choosing (for later conve- 
nience) a specific normalization and phase for each of u±(0) and u±(0), we 
get 


u+{ 0 ) = \fm 1 


u-( 0) = \fm 


f+(0) = s/m 


u_(0) = sjrn. 


(38.6) 
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For later use we also compute the barred spinors 

u s (p) = ul(p)/3 , 
v s (p) = v\(p)P , 

where 


satisfies 


We get 


/? T = /?t = (3~ l = f3 . 

**+(' 0 ) = sfm ( 1 , 0 , 1 , 0 ) , 

U— ( 0 ) = \Jm. ( 0 , 1 , 0 , 1 ) , 

Mo) = \M (o, -i, o, i) 

*T_(0) = \/m (1, 0, — 1, 0) 


(38.7) 


(38.8) 


(38.9) 


(38.10) 


We can now find the spinors corresponding to an arbitrary three-momentum 
p by applying to u s (0) and u s (0) the matrix D( A) that corresponds to an 
appropriate boost. This is given by 

D( A) = exp(i?yp-K) , (38.11) 

where p is a unit vector in the p direction, iP = f[7 J , 7 0 ] = is 

the boost matrix, and 7 = sinh^ 1 (|p|/?n) is the rapidity (see problem 2.9). 
Thus we have 


w s (p) = exp(ir/p-K)u s (0) , 
•p(p) = exp(?'r/ p-K)u s (0) . 


(38.12) 


We also have 


u s ( p) = u s (Q) exp(— p-K) , 

v s (p) = v s (0) exp(-iryp-K) . (38.13) 

This follows from iP = iP , where for any general combination of gamma 
matrices, 

A = PA^. (38.14) 

In particular, it turns out that 

y 1 = , 

-gjlu = guv 5 

*75 = *75 , 

7 m 75 = 7 M 75 , 

= i l5 S^ . 


(38.15) 
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The barred spinors satisfy the equations 

u s (p)(tf + m) = 0 , 

v s (p)(—tf + m) = 0 . (38.16) 

It is not very hard to work out it s (p) and u s (p) from eq. (38.12), but 
it is even easier to use various tricks that will sidestep any need for the 
explicit formulae. Consider, for example, u s >(p)u s (p); from eqs. (38.12) 
and (38.13), we see that u s /(p)u s (p) = u s >(0)u s (0), and this is easy to 
compute from eqs. (38.6) and (38.10). We find 

u s '(p)u s (p) = +2m 5 S ' S , 

v s '(p)v s (p) = -2m6 s > s , 
u s >( p)v s (p) = 0 , 

Vs'{p)u s (p) = 0 . (38.17) 

Also useful are the Gordon identities , 

2m u s t (p') 7 /i u s (p) = uy(p') {p' +pY - 2 \iS^[p' - p) v u s ( p) , 
-2mv s >(p)^v s (p) = v 8 >(p') (p' +pY - 2 iS^(p' - p) u v s (p) . (38.18) 

To derive them, start with 

W = HV*. + kb 1 **#] = -P^ ~ ZiS^Pv , (38.19) 

/r = HWl ~ WlYY] = ~P ,fl + 2 iS^p' v . (38.20) 

Add eqs. (38.19) and (38.20), sandwich them between u' and u spinors (or 
v' and v spinors), and use eqs. (38.1) and (38.16). An important special 
case is p' = p; then, using eq. (38.17), we find 

u s /(p)7 M u s (p) = 2 pP5 s ' s , 

u s /(p)7 M u s (p) = 2 pP8 a > s . (38.21) 

With a little more effort, we can also show 

u s >(p)y°v s (-p) = 0 , 

u s /(p)7°u s (-p) = 0 . (38.22) 

We will need eqs. (38.21) and (38.22) in the next section. 

Consider now the spin sums X) s =± u s(p)u s (p) and X) s =± v s(p)^s(p)j 
each of which is a 4 x 4 matrix. The sum over eigenstates of S z should 
remove any memory of the spin-quantization axis, and so the result should 
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be expressible in terms of the four- vector p^ and various gamma matrices, 
with all vector indices contracted. In the rest frame, if = —my 0 , and it is 
easy to check that ^ s =± u s (0)7t<j(0) = my° + m and Yf s =±v s (0)v s {0) = 
my° — m. We therefore conclude that 

Y u s (p)u s {p) = -rf + m, 

S=d= 

Y Vs(p)v 8 (p) = ~lf ~ rn . (38.23) 

S=d= 

We will make extensive use of eq. (38.23) when we calculate scattering cross 
sections for spin-one-half particles. 

From eq. (38.23), we can get u + (p)u + (p) , etc, by applying appropriate 
spin projection matrices. In the rest frame, we have 

|(1 + 2 s 5 2 ) u s /( 0 ) = S ss / u s >( 0 ) , 

|(1 - 2sS z )v s ,(0) = S ss / ty ( 0 ) . (38.24) 

In order to boost these projection matrices to a more general frame, we 
first recall that 

75 = iy°y 1 y 2 y 3 = ^ Q ^ ■ (38.25) 

This allows us to write S z = \y x y 2 as S z = — \y§y 3 y 0 - In the rest frame, 
we can write 7 0 as — jf/m , and y 3 as f, where z ^ = (0, z); thus we have 

s z = 2^75# • (38.26) 

Now we can boost S z to any other frame simply by replacing f and if with 
their values in that frame. (Note that, in any frame, z* 1 satisfies z 2 = 1 and 
z-p = 0.) Boosting eq. (38.24) then yields 

5(1 - Sy 5 f)u s '( p) = S ss r U s >( p) , 

1(1 - sy 5 f)vy(p) = S sg/ ty(p) , (38.27) 

where we have used eq. (38.1) to eliminate ]f. Combining eqs. (38.23) and 
(38.27) we get 


U s (p)u s (p) = 1(1 - S 75 ^)(-^ + 7n) , 

u s (p)u s (p) = 1(1 - sy 5 f)(-]f- m) . (38.28) 

It is interesting to consider the extreme relativistic limit of this formula. 
Let us take the three-momentum to be in the 2 direction, so that it is 
parallel to the spin-quantization axis. The component of the spin in the 
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direction of the three-momentum is called the helicity. A fermion with 
helicity + 1/2 is said to be right-handed , and a fermion with helicity — 1/2 
is said to be left-handed. For rapidity 77, we have 

= (cosh 77, 0 , 0 , sinh 77) , 

z ^ = (sinh 77 , 0,0, cosh 77 ) . (38.29) 

The first equation is simply the definition of 77, and the second follows from 
z 2 = 1 and p-z = 0 (along with the knowledge that a boost of a four-vector 
in the z direction does not change its x and y components). In the limit of 
large 77 , we see that 

= ±p» + 0(e“’ J ) . (38.30) 

Hence, in eq. (38.28), we can replace / with ]f/m, and then use the matrix 
relation (]f/m)(—]f ± m) = +(— 7 / ± 771), which holds for p 2 = —m 2 . For 
consistency, we should then also drop the m relative to 7 /, since it is down 
by a factor of 0(e~ v ). We get 

u s (p)u s (p) -»• ±(1 + sq 5 ) (- 7 /) , 

v s (p)ws(p) -> 5(1 - S 75 X- 7 O • (38.31) 

The spinor corresponding to a right-handed fermion (helicity +1/2) is 
77+ (p) for a 6 -type particle and u_(p) for a d-type particle. According 
to eq. (38.31), either of these is projected by /(I + 75) = diag(0, 0, 1, 1) 
onto the lower two components only. In terms of the Dirac field T(x), this 
is the part that corresponds to the right-handed Weyl field. Similarly, left- 
handed fermions are projected (in the extreme relativistic limit) onto the 
upper two spinor components only, corresponding to the left-handed Weyl 
field. 

The case of a massless particle follows from the extreme relativistic limit 
of a massive particle. In particular, eqs. (38.1), (38.16), (38.17), (38.21), 
(38.22), and (38.23) are all valid with m = 0, and eq. (38.31) becomes 
exact. 

Finally, for our discussion of parity, time reversal, and charge conjuga- 
tion in section 40, we will need a number of relationships among the u and 
v spinors. First, note that /3u s (0) = +tt s (0) and (3v s ( 0) = — u s (0). Also, 
(3K- 3 = We then have 

u s (-p) = +(3u s { p) , 

Vs(-p) = -fiv s ( p) . (38.32) 

Next, we need the charge conjugation matrix 

/ 0 -1 0 0 \ 

c = +1 0 0 0 

0 0 0 +1 

Vo 0 - 10 / 


(38.33) 
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which obeys 

C T = c f = C- 1 = -C , (38.34) 

pC = -CP , (38.35) 

C - Vc = -in^Y ■ (38.36) 

Using eqs. (38.6), (38.10), and (38.33), we can show that Cu s (0) T = u s (0) 
and CU S (0) T = it s (0). Also, eq. (38.36) implies C~ l K^C = —(K 3 ) T . From 
this we can conclude that 

Cu s { p) T = v s (p) , 

Cu s (p) T = u s (p) . (38.37) 

Taking the complex conjugate of eq. (38.37), and using U T * =v)= Pu , we 
get 

Kip) = C-Pv s { P) , 

u*(p) = CPu s (p) . (38.38) 

Next, note that 75m, (0) = +su_ s (0) and 75 (0) = -s«_ s (0), and that 
75 '75. Therefore 

75«s(p) = +su_ s (p) , 

75^s(p) = -su- s { p) . (38.39) 

Combining eqs. (38.32), (38.38), and (38.39) results in 

u*s(- P) = -sC7 5 u s (p) , 

u* s (-p) = -sC75Us(p) . (38.40) 

We will need eq. (38.32) in our discussion of parity, eq. (38.37) in our dis- 
cussion of charge conjugation, and eq. (38.40) in our discussion of time 
reversal. 

Problems 

38.1) Use eq. (38.12) to compute u s (p) and u s (p) explicity. Hint: show 
that the matrix 2ip-K has eigenvalues ±1, and that, for any matrix 
A with eigenvalues ±1, e cA = (coshc) + (sinh c)A. where c is an 
arbitrary complex number. 

38.2) Verify eq. (38.15). 

38.3) Verify eq. (38.22). 

38.4) Derive the Gordon identities 

Iv(p') ip' +pY - 2 iS^ip' -p) v 75«s(p) = 0 , 

*v(pO iP +PY ~ 2 iS^ip -p) v 75^ (p) = 0 . (38.41) 



39: Canonical Quantization of Spinor Fields II 


246 


39 Canonical Quantization of Spinor Fields 


II 

Prerequisite: 38 
A Dirac field 4/ with lagrangian 

C = - mW (39.1) 

obeys the canonical anticommutation relations 

I'M*, t), FFy, *)} = 0 , (39.2) 

{T q (x, t), Vpi y, t)} = (7°) Q /3 <5 3 (x - y) , (39.3) 

and has the Dirac equation 

{—ify + m)\h = 0 (39.4) 

as its equation of motion. The general solution is 

®(*) = [ d P M P)u s (p)e ipx + 4{p)v s (p)e~ ipx ] , (39.5) 

s=± J 


where 6 s (p) and d|(p) are operators; the properties of the four-component 
spinors it s (p) and u s (p) were belabored in the previous section. 

Let us express 6 s (p) and d\(p) in terms of T(x) and T(x). We begin 
with 

J d 3 x e~ ipx ^(x) = [^jMp)Mp) + ^e 2tut d\,(-p)v s >(-p) . 

s ' =± (39.6) 

Next, multiply on the left by u s (p)7°, and use m s (p)7 0 ^v(p) = 2 uS ss i and 
Us(p)l°v s '(-p) = 0 from section 38. The result is 

b s (p) = J d 3 x e~ ipx u s ( p)7°T(x) . (39.7) 

Note that b s (p) is time independent. 

To get b\( p), take the hermitian conjugate of eq. (39.7), using 

7L j (p)7°'F ;r ) f = F s (p)7°'J'C) 

= T(x)7°u s (p) 

= T(x)7°tt s (p) , (39.8) 

where, for any general combination of gamma matrices A, 

A = j9A f /3 . 


(39.9) 
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Thus we find 

&t(p) = J d 3 x e ipx \&(x)7°u s (p) . (39.10) 

To extract dj(p) from ’I'(x), we start with 

f d 3 xe wx ^(x)= [^ e ~ 2 “^M-p)*v(-p) + ^4'(P)^'(P) • 

J s’=± 

(39.11) 

Next, multiply on the left by u s (p)7°, and use U s (p)7°*v(p) = 2 uS ss i and 
U s (p)7°rv(-p) = 0 from section 38. The result is 

dj(p) = J d 3 x e ipx v s (p)-f°^{x) . (39.12) 

To get d s (p), take the hermitian conjugate of eq. (39.12), which yields 

d s ( p) = J d 3 x e~ ipx ^{x)Av s {p) . (39.13) 

Next, let us work out the anticommutation relations of the b and d op- 
erators (and their hermitian conjugates). From eq. (39.2), it is immediately 
clear that 

{Mp),Mp')I = 0 7 

{d s (p),d s >(p')} = 0 , 

{MpMI'CpO} = 0 , (39.14) 

because these involve only the anticommutator of T with itself, and this 
vanishes. Of course, hermitian conjugation also yields 

{^(P)> fc !'(P')} = 0 7 

{4(p),4(p')} = 0 , 

{bt(p),d s >(p')} = 0 . (39.15) 

Now consider 

{Mp),^(p')} = j d 3 xd 3 ye- ipx+ip 'yu s {p)A{^{x)^{y)WuAp') 

= J d 3 xe- i(p - p ' )x u s {pAA°l°u s '{p') 

= (27r) 3 5 3 (p - p , )u s (p)7°^'(p) 

= (27r) 3 <l 3 (p — p') 2co5 ss > . (39.16) 

In the first line, we are free to set x° = y° because 6 s (p) and b\, (p 7 ) are 
actually time independent. In the third, we used (7 0 ) 2 = 1, and in the 
fourth, u s (p)7°u s /(p) = 2u5 ss '. 
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Similarly, 

{4( P ),4'(P')} = / d 3 xd 3 ye^-^^ s (p) 7 °{^(a;),^(y)}7 0 ^(p / ) 

= J d 3 xe^ p - p ' )x v s ( p)7 0 7 0 7°^(p / ) 

= (2vr) 3 (5 3 (p - P , )^s(p)7°^ , (p) 

= (27r) 3 (5 3 (p — p ; ) 2u5 ss ' . (39.17) 

And finally, 

{Mp),4'(p')} = / d 3 xd 3 ye-^-^lZ s (p)7 0 {^(x),^(y)}7 0 ^(p / ) 

= j d 3 xe~ i{p+p ' )x u s ( P)7 0 7 0 7°^'(P / ) 

= (2tt) 3 (5 3 (p + p , ) : u s (p)7 0 ^'(-p) 

= 0 . (39.18) 

According to the discussion in section 3, eqs. (39.14-39.18) are exactly what 
we need to describe the creation and annihilation of fermions. In this case, 
we have two different kinds: 6-type and cl- type, each with two possible spin 
states, s = + and s = — . 

Next, let us evaluate the hamiltonian 

H = J d 3 x \H(— irfdi + m)^f (39.19) 

in terms of the b and d operators. We have 

(-i'fdi +m)^ = J d'P ( - 'h l d % + m) ( b s ( p ) u s ( p ) e rpx 

+ 4(p)v s (p)e- ipx ) 

= [ d P b s (p)(+fpi + m)u s (p)e ipx 

S=d= 

+ 4(p)(-7Vi + m)v s {p)e~ ipx 

= J d P b s (p)(j°u)u s (p)e ipx 

+ di(p)(- 1 °u)v s (p)e- ipx }. (39.20) 

Therefore 

H = J d P dp' d 3 x {b\,(p)u s '(p')e~ ip ' x + d s >( p')v s '(p')e ip ' x ^ 
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x u; (6 S ( p) 7 V(p)e^ - 4(p) 7 °^(p)e-^) 

= Y,J d P d P' d3x u 4'(p')Mp) tt s '(p , )7°^s(p) e~ dp '^ x 

- &1(p')4(p) Mp'bV(p) e-W^ x 

+ d s /(p')Mp) <v(p , )7°«s(p) e +dp ’ +p)x 

- d s ,(p')4(p) «»'(p')7%(p) e +dp, ~ p >' 
dp \ bl,(p)b s (p) u s '(p)7V(p) 

-bi,(-p)di(p)uA-ph°v s (p) e +2iujt 
+ d s '(-p)b s (p ) v s /(-p) 7 °ii s (p) e~ 2lujt 
~ d s '(p)dl(p) v s '(p)7°v s (p) 

= J2f d P UJ 4(p)Mp) -4(p)4(p) • (39.21) 

S J 

Using eq. (39.17), we can rewrite this as 

H = f dp u 4(p)Mp) +dl(p)d s (p) -4 £ 0 V , (39.22) 



where So = \{2i r) -3 / d 3 k ui is the zero-point energy per unit volume that 
we found for a real scalar held in section 3, and V = (27r) 3 d 3 (0) = / d 3 x 
is the volume of space. That the zero-point energy is negative rather than 
positive is characteristic of fermions; that it is larger in magnitude by a 
factor of four is due to the four types of particles that are associated with a 
Dirac held. We can cancel off this constant energy by including a constant 
term Do = — 4£o in the original lagrangian density; from here on, we will 
assume that this has been done. 

The ground state of the hamiltonian (39.22) is the vacuum state |0) 
that is annihilated by every b s ( p) and d s (p), 


b s ( P)|0> = 4(p)|0) = 0 . (39.23) 

Then, we can interpret the 6|(p) operator as creating a 6-type particle 
with momentum p, energy lo = (p 2 + m 2 ) 1 / 2 , and spin S z = and the 
4(p) operator as creating a d-type particle with the same properties. The 
6-type and d-type particles are distinguished by the value of the charge 
Q = / d 3 xj°, where j 1 ' = 4' 7 /i 4' is the Noether current associated with the 
invariance of C under the U(l) transformation T — > e~ ia 4 ! , T — > e +lQ T. 
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Following the same procedure that we used for the hamiltonian, we can 
show that 

Q = f d 3 x T 7 °T 

= [ d P bt(p)b s (p) + d s (p)dt(p) 

s=± J 

= ^ f dp 6j(p)6 s (p) — dl(p)d s (p) + constant . (39.24) 

s=±d 

Thus the conserved charge Q counts the total number of 6-type particles 
minus the total number of d-type particles. (We are free to shift the overall 
value of Q to remove the constant term, and so we shall.) In quantum 
electrodynamics, we will identify the 6-type particles as electrons and the 
d-type particles as positrons. 

Now consider a Majorana field 'h with lagrangian 

C = - ±mT T CT . (39.25) 

The equation of motion for 'h is once again the Dirac equation, and so the 
general solution is once again given by eq. (39.5). However, T must also 
obey the Majorana condition 'h = CT t . Starting from the barred form of 
eq. (39.5), 

^( x ) = X! [dp bl(p)u s (p)e~ vpx + d s (p)v s (p)e tpx , (39.26) 

s=± J 

we have 

CT t (x) = [dp b\(p)Cu T s {p)e~ tpx + d s (p) Cv T s (p)e ipx . (39.27) 

s=± J 

From section 38, we have 

Cn s (p) T = u s (p) , 


Cn s (p) T = u s (p) , (39.28) 

and so 

CT t (x) = [dp b\{p)v s (p)e~ wx + d s (p)u s (p)e %px . (39.29) 

s=± J 

Comparing eqs. (39.5) and (39.29), we see that we will have 'h = CT t if 

d s ( p) = 6 s (p) . (39.30) 
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Thus a free Majorana field can be written as 

^(z) = [ d P b s (p)u s (p)e ipx + bl{p)v s {p)e~ ipx . (39.31) 

.s=± ■' 

The anticommutation relations for a Majorana field, 

{T a (x, t ), T /3 (y, t)} = (C7°)a/3 <5 3 ( x - y) , (39.32) 

{T a (x, t), ^a(y, f)} = (7 °)q /3 ^ 3 ( x - y) >• (39.33) 

can be used to show that 

{Mp)A'(p')} = 0 > 

{6 s (p),6^,(p')} = (27r) 3 <5 3 (p — p r ) 2lo 5 ss / , (39.34) 

as we would expect. 

The hamiltonian for the Majorana field is 

H = l j d 3 x + m)T 

= \ J d 3 x \H (— irfdi + m)'I> , (39.35) 


and we can work through the same manipulations that led to eq. (39.21); the 
only differences are an extra overall factor of one-half, and d s (p) = b s ( p). 
Thus we get 

H = \Yh [ d P u bl(p)b s (p) - 6 s (p)6j(p) . (39.36) 

s=± J 

Note that this would reduce to a constant if we tried to use commutators 
rather than anticommutators in eq. (39.34), a reflection of the spin-statistics 
theorem. Using eq. (39.34) as it is, we find 

H = ^2 [dp vbl(p)b s (p) - 2 £qV. (39.37) 

s=± J 

Again, we can (and will) cancel off the zero-point energy by including a 
term Qq = —2£q in the original lagrangian density. 

The Majorana lagrangian has no U(l) symmetry. Thus there is no 
associated charge, and only one kind of particle (with two possible spin 
states). 


Problems 
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39.1) Verify eq. (39.24). 

39.2) Use = —i{x p, d v — x u <9 M )\I/ (x) + S [W <]/(&■), plus whatever 

spinor identities you need, to show that 

Jzbl(pz) |0) = ±sbl(pz)\0) , 

Jzdl(pz) |0) = \sd\(jpz) |0) , (39.38) 

where p = pz is the three-momentum, and z is a unit vector in the z 
direction. 

39.3) Show that 

(7(A)->6t(p)(7(A) = 6t(A-!p) , 

(7(A)-‘4(p)(7(A) = 4(A-‘p) , (39.39) 

and hence that 

U(A)\p,s,q) = \Ap,s,q) , (39.40) 

where 

\p,s,+) = 6J(P)I°) > 

|p,s,-) = 4(P)|0) (39.41) 


are single-particle states. 

39.4) The spin- statistics theorem for spin- one-half particles. We will follow 
the proof for spin-zero particles in section 4. We start with b s { p) 
and b\{ p) as the fundamental objects; we take them to have either 
commutation (— ) or anticommutation (+) relations of the form 


[Mp)>Mp)] t = 0 > 

[&J(p)>&I'(p')]:f = 0 > 

[b s { p),&Up / )]q= = (2vr) 3 2cu(5 3 (p - p')6 ss > . 

Define 


^ + (^) = I dpb s (p)u s (p)e ipx , 

s=± J 



(39.42) 


(39.43) 


a) Show that {7(A) 1 4 ' ± (.t)C/(A) = D(A)4' ± (A l x). 
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b) Show that [T + ( x )]* = [*- (x)] T C/3. Thus a hermitian interaction 
term in the lagrangian must involve both T + (x) and T~(x). 

c) Show that [T+(x), Vl/^(r/)]=p / 0 for (x — y ) 2 > 0. 

d) Show that [*+(x), ^(y)] T = -[^(y),^~(x)}-_ F for (x - y) 2 > 0. 

e) Consider 'h(x) = T + (x) + A'I'~(x), where A is an arbitrary complex 
number, and evaluate both [U/ Q (x),yp(y)h and [®a(*),®/3(y)] q= for 
(x — y ) 2 > 0. Show these can both vanish if and only if |A| = 1 and 
we use anticommutators. 
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40 Parity, Time Reversal, and Charge 

Conjugation 

Prerequisite: 23, 39 


Recall that, under a Lorentz transformation A implemented by the unitary 
operator 17(A), a Dirac (or Majorana) field transforms as 

U(A) _1 T(x)L r (A) = D(A)T(A” 1 x) . (40.1) 


For an infinitesimal transformation A M V = & L V + 6uj^ v , the matrix D( A) is 
given by 

D(l+Su) = 1+ , (40.2) 

where the Lorentz generator matrices are 


S'" = • 


(40.3) 


In this section, we will consider the two Lorentz transformations that cannot 
be reached via a sequence of infinitesimal transformations away from the 
identity: parity and time reversal. We begin with parity. 

Define the parity transformation 


PC = (P _1 )C 


f +1 \ 

-1 

-1 

V -1 / 


(40.4) 


and the corresponding unitary operator 


P = U(V) . 


(40.5) 


Now we have 

P _1 ®(x)P = D(V)^(Vx) . (40.6) 

The question we wish to answer is, what is the matrix D(V)1 
First of all, if we make a second parity transformation, we get 

P“ 2 ®(x)P 2 = D(P) 2 T(x) , (40.7) 

and it is tempting to conclude that we should have D(V) 2 = 1, so that we 
return to the original field. This is correct for scalar fields, since they are 
themselves observable. With fermions, however, it takes an even number 
of fields to construct an observable. Therefore we need only require the 
weaker condition D(V) 2 = ±1. 
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We will also require the particle creation and annihilation operators to 
transform in a simple way. Because 

P~ l PP = -P , (40.8) 

P~ l iP = +J , (40.9) 

where P is the total three-momentum operator and J is the total angu- 
lar momentum operator, a parity transformation should reverse the three- 
momentum while leaving the spin direction unchanged. We therefore re- 
quire 

P^bl{p)P = rjbl(-p) , 

P- 1 4( P )P = r/4(-p) , (40.10) 

where is a possible phase factor that (by the previous argument about 
observables) should satisfy r] 2 = ±1. We could in principle assign different 
phase factors to the b and d operators, but we choose them to be the same so 
that the parity transformation is compatible with the Majorana condition 
d s ( p) = b ( j(p). Writing the mode expansion of the free field 

*(z) = E [ & + 4(p)u s (p)e-H , (40.11) 

s=± J 

the parity transformation reads 
P~ 1 4'(x)P 

= E /^[(^- 1 fc s (p)P)^(p)e^ + (p- 1 4(p)P)^( P )e- W 

s=± J 

= E [^p[v*U-p)MPy px +vdi(-p)v s (p)e~ i P x 

s=± J 

= E [ d P r fbs{p)u s {-p)(i lpVx + ridl(p)v s (-p)e~ vpVx . (40.12) 

s=± J 

In the last line, we have changed the integration variable from p to — p. 
We now use a result from section 38, namely that 

u s (-p) = +(3u s { p) , 

v s (-p) = -fiv s (p) , (40.13) 

where 

'-c :)• <4oi4) 
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Then, if we choose rj = —i, eq. (40.12) becomes 


P~ 1 ^(x)P = 

S=d= 


dp \ib s {p)(3u s (p)e wVx 


+ idl(p)(3v s (p)e- ipVx 


= (Vx) . 


(40.15) 


Thus we see that D(V) = i(3 . (We could also have chosen r] = i, resulting 
in D(V) = — i/3 ; either choice is acceptable.) 

The factor of i has an interesting physical consequence. Consider a 
state of an electron and positron with zero center-of-mass momentum, 

\4>) = f dp 0(p)6j(p)4/(-p)|O) ; (40.16) 


here 4>(p) is the momentum-space wave function. Let us assume that the 
vacuum is parity invariant: P|0) = P _1 |0) = |0). Let us also assume that 
the wave function has definite parity: </>(— p) = (— )^c/>(p). Then, applying 
the inverse parity operator on |0), we get 

p- l \<t>) = J dp</>(p)(p- 1 6t(p)p)(p- i 4(- P )p)p- i |o) . 

= H) 2 / p) & I(-p)4'(p)I°> 

= H) 2 / d P<K-p) 6 i(p)4'(-p)l°> 

= -(-)V> • (40.17) 


Thus, the parity of this state is opposite to that of its wave function; an 
electron-positron pair has an intrinsic parity of —1. This also applies to a 
pair of Majorana fermions. This influences the selection rules for fermion 
pair annihilation in theories that conserve parity. (A pair of electrons also 
has negative intrinsic parity, but this is less interesting because the electrons 
are prevented from annihilating by charge conservation.) 

Let us see what eq. (40.15) implies for the two Weyl fields that comprise 
the Dirac field. Recalling that 



we see from eqs. (40.14) and (40.15) that 

P^Xa(x)P = ig & (Vx) , 
P~ 1 ^ a (x)P = i X a(Vx) . 


(40.18) 


(40.19) 
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Thus a parity transformation exchanges a left-handed field for a right- 
handed one. 

If we take the hermitian conjugate of eq. (40.19), then raise the index on 
one side while lowering it on the other (and remember that this introduces 
a relative minus sign!), we get 

P~ 1 X ^(x)P = itj a (Vx) , 


P~ 1 Ux)P = ix U {Vx) . 

(40.20) 

Comparing eqs. (40.19) and (40.20), we see that they are compatible with 
the Majorana condition Xa{x) = £, a (x). 

Next we take up time reversal. Define the time-reversal transformation 

/- 1 N 

= +1 +1 
V +i) 

(40.21) 

and the corresponding operator 


T = U(T) . 

(40.22) 

Now we have 

T" 1 T(x)T = D(T)V(Tx) . 

(40.23) 


The question we wish to answer is, what is the matrix D(T)? 

As with parity, we can conclude that D(T) 2 = ±1, and we will require 
the particle creation and annihilation operators to transform in a simple 
way. Because 


T _1 PT = -P , (40.24) 

T _1 JT = -J , (40.25) 

where P is the total three-momentum operator and J is the total angu- 
lar momentum operator, a time-reversal transformation should reverse the 
direction of both the three-momentum and the spin. We therefore require 

T- 1 6t( p )T = C s 6 t _ s (-p), 

T-^pXT = ( s dl s (-p) • (40.26) 

This time we allow for possible -s-dependence of the phase factor. Also, 
we recall from section 23 that T must be an antiunitary operator, so that 
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T 1 iT = —i. Then we have 


T~ i T(x)T 


E / dp[(T- 1 6 s (p)T)<(p)e- < ^ + (r- 1 4( p)T)<(p)e^ 


s=d= ' 


= E / ^[C6-,(-pK(p)e-^ + C^ t _,(-pK(p)e^ 


s=d= ' 


= E / #k-^ s (p)^- s (-p)e^ + C-s4(p)^,(-p)e- ?pTs 


S=± ' 


(40.27) 


In the last line, we have changed the integration variable from p to — p, and 
the summation variable from s to — s. We now use a result from section 
38, namely that 


u*- s (-p) = -sCj 5 u s (p) , 

v*L s (-p) = -sC 7 5 u s (p) . (40.28) 

Then, if we choose Cs = s, eq. (40.27) becomes 

T~ 1 ^{x)T = C 75 T(Tx) . (40.29) 


Thus we see that D(T) = C75. (We could also have chosen ( s = —s, 
resulting in D(T) = —C 75; either choice is acceptable.) 

As with parity, we can consider the effect of time reversal on the Weyl 
fields. Using eqs. (40.18), (40.29), 


and 


we see that 



T~\a{x)T = +x a (Tx) , 
r- 1 e t “(x)T = -ei(Tx) . 


(40.30) 


(40.31) 


(40.32) 


Thus left-handed Weyl fields transform into left-handed Weyl fields (and 
right-handed into right-handed) under time reversal. 
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If we take the hermitian conjugate of eq. (40.32), then raise the index on 
one side while lowering it on the other (and remember that this introduces 
a relative minus sign!), we get 

T- 1 X td (x)T = -xl(Tx) , 

T-^aOT = +£ a (Tx) . (40.33) 

Comparing eqs. (40.32) and (40.33), we see that they are compatible with 
the Majorana condition Xa(x) = £ a (x)- 

It is interesting and important to evaluate the transformation properties 
of fermion bilinears of the form 'PT'F, where A is some combination of 
gamma matrices. We will consider ^4’s that satisfy A = A, where A = 
pAi/3; in this case, \F.A\F is hermitian. 

Let us begin with parity transformations. From T = 4+/3 and eq. (40.15) 
we get 

P _1 tf (®)P = -m{Vx)(3 , (40.34) 

Combining eqs. (40.15) and (40.34) we find 

P” 1 (lC4T)p = H(pAp) T , (40.35) 

where we have suppressed the spacetime arguments (which transform in 
the obvious way). For various particular choices of A we have 

PIP = +1 , 

PiUsP = -ils , 

Pl°P = +7° , 

PYP = -7* , 

/?7°75/3 = ~7°75 , 

PY'lbP = +7*75 • (40.36) 

Therefore, the corresponding hermitian bilinears transform as 

P _1 (^)P = +TT , 
p _1 ^*7 5 ^p = , 

P 1 ("+7^75 + P = -P m ^7^7 5 T , (40.37) 

Thus we see that 'F’F and 'I'y^’F are even under a parity transformation, 
while 'Frys'F and are odd. We say that TT is a scalar, 4 / 7 ^'F 
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is a vector or polar vector, TrysT is a pseudoscalar, and ^7^75^ is a 
pseudovector or axial vector. 

Turning to time reversal, from eq. (40.29) we get 

T“ 1 T(x)T = T(Tx) 75C- 1 . (40.38) 

Combining eqs. (40.29) and (40.38), along with T~ 1 AT = A*, we find 

T _1 (TAT)r = t( 7 5 C~U*C7 5 )t , (40.39) 

where we have suppressed the spacetime arguments (which transform in the 
obvious way). Recall that C _1 7 M C = — (7^) T and that C~ l ^C = 75. Also, 
7 0 and 75 are real, hermitian, and square to one, while 7* is antihermitian. 
Finally, 75 anticommutes with 7^. Using all of this info, we find 

7 5 CT 1 1*C75 = +1 , 

7 5 £U 1 (i7 5 )*C75 = -z7 5 , 

75C _1 (7°)*C75 = +7° r 

75 c“ 1 (7yc75 = , 

7 5 £U 1 (7 0 7 5 )*C75 = +7°75 , 

7 5 C _1 (7*75)^75 = -7*75 • (40.40) 

Therefore, 

T _l (tt)t = +TT , 

T _1 ^75^T = - TU/s T , 

= -T^^T , 

T" 1 (^7 /i 75^)T = - T^^^T . (40.41) 

Thus we see that TT is even under time reversal, while \Hi75\H, T7^T, and 
^ , 7 m 75 v I / are odd. 

For completeness we will also consider the transformation properties of 
bilinears under charge conjugation. Recall that 


C-^ixjC = CT t (x) , 


C~ 1 ~^{x)C = T T (x)C . 

(40.42) 

The bilinear TAT therefore transforms as 


c , “ 1 (tat)c = T t CACT t . 

(40.43) 
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Since all indices are contracted, we can rewrite the right-hand side as its 
transpose, with an extra minus sign for exchanging the order of the two 
fermion fields. We get 


C^^A^C = -+C+4 T C T + . 

(40.44) 

Recalling that C T = C _1 = —C, we have 


C'" 1 (+,4+)C' = + (c _1 M T c) + . 

(40.45) 

Once again we can go through the list: 


C~ 1 l T C = +1 , 

C^ 1 (*75) t C = +*75 , 

c-\ryc = -r , 

C^ 1 (7^75) t C = +7 m 75 • 

(40.46) 

Therefore, 


C7 _1 (++)(7 = + ++ , 


<7 _1 (+i7 5 +)c' = ++*75+ , 
C'" 1 (+7 /i +)c = — +7^+ , 

C i_ 1 ^+7 m 75+^C i = ++7^75+ . 

(40.47) 

Thus we see that ++, +*75+. and +7^75+ are even 

under charge conju- 


gation, while +7^ + i s odd. 

For a Majorana field, we have C'~ 1 'PC = + and C'~ 1 'PC I = +; this 
implies C ,_1 (+ J 4+)C' = +T+ for any combination of gamma matrices A. 
Since eq. (40.47) tells that C l_1 (+7 /i +)C' = — vpy j vp for either a Dirac or 
Majorana field, it must be that +7^+ = 0 for a Majorana field. 

Let us consider the combined effects of the three transformations (C, 
P, and T) on the bilinears. From eqs. (40.37), (40.41), and (40.47), we have 

(CPT)- 1 (VV')CPT = + ++ , 

(CPT)- 1 (+i 75 +) CPT = + +i 75 + , 
(CPT)- 1 (l?'fy')CPT = — +7+ + , 

{CPT)" 1 (+7^75+) CPT = - +7^75 + , (40.48) 

where we have used V^ v T v p = —5^ p . We see that ++ and 'Pi 75 4* are both 
even under CPT, while +7^+ and +7^75+ are both odd. These are (it 
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turns out) examples of a more general rule: a fermion bilinear with n vector 
indices (and no uncontracted spinor indices) is even (odd) under CPT if n 
is even (odd). This also applies if we allow derivatives acting on the fields, 
since each component of d ^ is odd under the combination PT and even 
under C . 

For scalar and vector fields, it is always possible to choose the phase 
factors in the C , P, and T transformations so that, overall, they obey the 
same rule: a hermitian combination of fields and derivatives is even or odd 
depending on the total number of uncontracted vector indices. Putting this 
together with our result for fermion bilinears, we see that any hermitian 
combination of any set of fields (scalar, vector, Dirac, Majorana) and their 
derivatives that is a Lorentz scalar (and so carries no indices) is even under 
CPT. Since the lagrangian must be formed out of such combinations, we 
have C(x) —> C(—x) under CPT, and so the action S = f d 4 x £ is invariant. 
This is the CPT theorem. 


Reference Notes 

A detailed treatment of CPT for fields of any spin is given in Weinberg I. 

Problems 

40.1) Find the transformation properties of T T and iS 1 ' 11 under 
P, T, and C. Verify that they are both even under CPT, as claimed. 
Do either or both vanish if 'I' is a Majorana held? 
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41 LSZ Reduction for Spin-One-Half 

Particles 

Prerequisite: 5, 39 

Let us now consider how to construct appropriate initial and final states 
for scattering experiments. We will first consider the case of a Dirac field 
T, and assume that its interactions respect the U(l) symmetry that gives 
rise to the conserved current j tl = and its associated charge Q. 

In the free theory, we can create a state of one particle by acting on the 


vacuum state with a creation operator: 

\P,S,+) = «4(p)|0> , (41.1) 

L,S,-) = 4(p)l°) > ( 41 -2) 

where the label ± on the ket indicates the value of the U(l) charge Q , and 

bt(p) = I d 3 x e^ObVte) , (41-3) 

4(p) = J d 3 xe ipx v s ( p) 7 °^(x) . (41.4) 

Recall that &J(p) and d\( p) are time independent in the free theory. The 
states | p, s, ±) have the Lorentz-invariant normalization 

( V , s, q\p, s', q) = (2? r) 3 <5 3 (p - p') S ss > 5 qq > , (41.5) 

where uj = (p 2 + m 2 ) 1 / 2 . 


Let us consider an operator that (in the free theory) creates a particle 
with definite spin and charge, localized in momentum space near pi, and 
localized in position space near the origin: 

b\ = j d 3 p / 1 (p)6j 1 (p) , (41.6) 

where 

fi(p) oc exp[— (p - P i) 2 /4 a 2 ] (41.7) 

is an appropriate wave packet, and a is its width in momentum space. If 
we time evolve (in the Schrodinger picture) the state created by this time- 
independent operator, then the wave packet will propagate (and spread 
out). The particle will thus be localized far from the origin as t — > Too. If 
we consider instead an initial state of the form |i) = b\ b\ \ 0) , where pi / P 2 , 
then we have two particles that are widely separated in the far past. 

Let us guess that this still works in the interacting theory. One compli- 
cation is that b\( p) will no longer be time independent, and so b\, eq. (41.6), 



41: LSZ Reduction for Spin-One-Half Particles 


264 


becomes time dependent as well. Our guess for a suitable initial state for 
a scattering experiment is then 

|i) = lirn b\(t)bl(t) |0) . (41.8) 

t — > — OO 


By appropriately normalizing the wave packets, we can make (i\i) = 1, and 
we will assume that this is the case. Similarly, we can consider a final state 

I/) = t Jj^ & i'(*) & 2'(*)l°} > ( 4L9 ) 

where pj / p' 2 , and (/|/) = 1. This describes two widely separated par- 
ticles in the far future. (We could also consider acting with more creation 
operators, if we are interested in the production of some extra particles in 
the collision of two, or using d' operators instead of b t operators for some 
or all of the initial and final particles.) Now the scattering amplitude is 
simply given by (f\i). 

We need to find a more useful expression for (f\i). To this end, let us 
note that 


b\ 


(-oo) 


&{(+oo) 



dt dob\ (t) 


~ J d 3 ph(p ) J A 9 o(e^$(x) 7 °« Sl (p)) • 

- J d 3 p fi (p) J d 4 x T (x) (7 0 d 0 - i7°p°) u Sl ( p)e ipx 

- j Sp f\ (p) J d 4 x^(x)(j°d 0 -iYpi~irnju sl (p)e tpx 
~ J d 3 p fi (p) J d 4 x T (x) (7 0 d 0 - 7* <9; - irnj u si ( p)e ipx 

- J d 3 p fi (p) J d 4 x T (x) (7 0 d 0 + 7 * <9; - irnj u si ( p)e ipx 

i J d :i p f\ (p) J d 4 x 4 >(x)(+i^ + m)u si (p)e ipx . (41.10) 


The first equality is just the fundamental theorem of calculus. To get the 
second, we substituted the definition of b\(t), and combined the d 3 x from 
this definition with the dt to get d 4 x. The third comes from straightforward 
evaluation of the time derivatives. The fourth uses (j/> + m)u s ( p) = 0. The 
fifth writes ipi as <9* acting on e ipx . The sixth uses integration by parts to 
move the <9* onto the field T (x ) ; here the wave packet is needed to avoid a 
surface term. The seventh simply identifies 7 °< 9 o + 7 l di as (j). 

In free-held theory, the right-hand side of eq. (41.10) is zero, since T(x) 
obeys the Dirac equation, which, after barring it, reads 


\H(x)(-| -ilj) + m) = 0 . 


(41.11) 
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In an interacting theory, however, the right-hand side of eq. (41.10) will not 
be zero. 

We will also need the hermitian conjugate of eq. (41.10), which (after 
some slight rearranging) reads 

i>i(+oo) - i>i(— oo) 

= i J d 3 p f\ (p) J d 4 x e~ ipx u si (p)(— itj) + m)'L(x) , (41.12) 

and the analogous formulae for the d operators, 
d\(—oo) — d{(+oo) 

= — i J d 3 p f\( p) J d 4 x e zpx v Sl (p)(— iff) + m)'L(x) , (41.13) 

di(+oo) — d\(— oo) 

= i J d 3 p fi( p) J d 4 x T(x)(-H# + m)v si (p)e~ ipx . (41.14) 

Let us now return to the scattering amplitude we were considering, 

if\i) = (0|6 2 '(+oo)6i/(+oo)b]((-oo)6|(-oo)|0) . (41.15) 

Note that the operators are in time order. Thus, if we feel like it, we can 
put in a time- ordering symbol without changing anything: 

(f\i) = (0| T62'(+ oo )^i , (+ oo )^i( _oo )4( _oo )|0) • (41.16) 

The symbol T means the product of operators to its right is to be ordered, 
not as written, but with operators at later times to the left of those at 
earlier times. However, there is an extra minus sign if this rearrangement 
involves an odd number of exchanges of these anticommuting operators. 

Now let us use eqs. (41.10) and (41.12) in eq. (41.16). The time-ordering 
symbol automatically moves all 6j/(— oo)’s to the right, where they anni- 
hilate |0). Similarly, all 6,}(+oo)’s move to the left, where they annihilate 

<°l- 

The wave packets no longer play a key role, and we can take the a — > 0 
limit in eq. (41.7), so that /i(p) = 5 3 (p — Pi). The initial and final states 
now have a delta-function normalization, the multiparticle generalization of 
eq. (41.5). We are left with the Lehmann- Symanzik-Zimmermann reduction 
formula for spin-one-half particles, 

(/ 1 *) = % 4 j d 4 X 1 d 4 x 2 d 4 X I' d 4 x 2 > 

x [u Si ,(p 1 /)(-i^ 1 , +m)\ ai , 
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x e [u Sa ,(p 2 ')(-*0 2 / + m)]a 2 , 

X (0| T ^a 2 ! {X2')^ ai , (xiO^ai (^l)^a 2 ( x 2)|0) 


X [{+i $ i + m)u Sl (pi)] Ql e ipi:ri 

X [(+*^2 + rn)u S2 (p 2 )\a 2 e ip2X2 . (41.17) 

The generalization of the LSZ formula to other processes should be clear; 
insert a time-ordering symbol, and make the following replacements: 

i4(p)in — ► +* /A 4t(x)(+i<j) + m)it s (p) e + * pa: , (41.18) 

& s (p)out ->• +* J d A x e~ ipx u s (p)(-i<jD + m)'F(x) , (41.19) 

<4(p)in — >• — * J d A x e +tpx v s (p)(— i<jD + m)'4>(x) , (41.20) 

4(p)out ->• ~i J d A x ' L(x)(-M $ + m)v 8 { p) e~ ipx , (41.21) 

where we have used the subscripts “in” and “out” to denote t — oo and 
t — > +oo, respectively. 


All of this holds for a Majorana field as well. In that case, G? s (p) = 6 s (p), 
and we can use either eq. (41.18) or eq. (41.20) for the incoming particles, 
and either eq. (41.19) or eq. (41.21) for the outgoing particles, whichever is 
more convenient. The Majorana condition T = T T C guarantees that the 
results will be equivalent. 

As in the case of a scalar field, we cheated a little in our derivation 
of the LSZ formula, because we assumed that the creation operators of 
free field theory would work comparably in the interacting theory. After 
performing an analysis that is entirely analogous to what we did for the 
scalar in section 5, we come to the same conclusion: the LSZ formula holds 
provided the field is properly normalized. For a Dirac held, we must require 


(0 'L(x) 0) = 0 , 

(41.22) 

(p,s,+|T(x)|0) = 0 , 

(41.23) 

(p,s,-Mx)\0) = v s (p)e~ ipx , 

(41.24) 

(p,s,+|T(x)|0) = u s (p)e~ ipx , 

(41.25) 

(p,s,-|T(x)|0) = 0 , 

(41.26) 


where (0|0) = 1, and the one-particle states are normalized according to 
eq. (41.5). 
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The zeros on the right-hand sides of eqs. (41.23) and (41.26) are required 
by charge conservation. To see this, start with [T(x), Q\ = +4'(x), take the 
matrix elements indicated, and use Q|0) = 0 and Q\p , s, ±) = ±| p, s, ±). 

The zero on the right-hand side of eq. (41.22) is required by Lorentz 
invariance. To see this, start with [4/(0), M pu \ = S^T(O), and take the 
expectation value in the vacuum state |0). If 1 0) is Lorentz invariant (as we 
will assume), then it is annihilated by the Lorentz generators M /i!y , which 
means that we must have 5 Mi/ (0|4'(0)|0) = 0; this is possible for all p and 
v only if (0|T(0)|0) = 0, which (by translation invariance) is possible only 
if (0|4 /(,t)|0) = 0. 

The right-hand sides of eqs. (41.24) and (41.25) are similarly fixed by 
Lorentz invariance: only the overall scale might be different in an interact- 
ing theory. However, the LSZ formula is correctly normalized if and only if 
eqs. (41.24) and (41.25) hold as written. We will enforce this by rescaling 
(or, one might say, renormalizing ) 4/(.x) by an overall constant. This is 
just a change of the name of the operator of interest, and does not affect 
the physics. However, the rescaled 'L(.t) will obey eqs. (41.24) and (41.25). 
(These two equations are related by charge conjugation, and so actually 
constitute only one condition on 4/.) 

For a Majorana field, there is no conserved charge, and we have 

<0|4/(.t)|0) = 0 , (41.27) 

{p, s|4/(x)|0) = v s (p)e~ wx , (41.28) 

{p, s|’F(:c)|0) = u s (p)e~ wx , (41.29) 

instead of eqs. (41.22-41.26). 

The renormalization of 4^ necessitates including appropriate Z factors 
in the lagrangian. Consider, for example, 

C = - Z m mW - lZ g g{^4>) 2 , (41.30) 

where T is a Dirac field, and g is a coupling constant. We choose the three 
constants Z, Z m , and Z g so that the following three conditions are satisfied: 
m is the mass of a single particle; g is fixed by some appropriate scattering 
cross section; and eq. (41.24) and is obeyed. [Eq. (41.25) then follows by 
charge conjugation.] 

Next, we must develop the tools needed to compute the correlation 
functions (0|T4' Q , 1 , (xy) . . . T^xi) . . . |0) in an interacting quantum field 
theory. 

Problems 

41.1) Assuming that eq. (39.40) holds for the exact single-particle states, 
verify eqs. (41.23) and (41.26), up to overall scale. 
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42 The Free Fermion Propagator 

Prerequisite: 39 

Consider a free Dirac field 

*(*) = £ [dp \b s (p)u s (p)e ipx + d\(p)v s (p)e~ ipx \ , (42.1) 

s=± J 

®(y) = f d P' bl'{p)u s '{p)e~ ip ' y + d s ’(p , )v s '(p , )e ip ' y , (42.2) 

S'=± J 

where 

Mp)I°> = 4(p)|0) = 0 , (42.3) 

and 

{& s (p),&Up')} = (27r) 3 T(p - p')2iv5 ss > , (42.4) 

{4(p),4'(p)} = (27r) 3 (5 3 (p — p ; ) 2uj8 ss i , (42.5) 

and all the other possible anticommutators between b and d operators (and 
their hermitian conjugates) vanish. 

We wish to compute the Feynman propagator 

S(x - y) a p = *(0|T^ a (x)’F s (y)|0) , (42.6) 

where T denotes the time-ordered product, 

Ttf a (s)^g(y) = 9(x° - y°)^ a (x)^ p (y) - 6(y° - x°)V p (y)V a (x) , (42.7) 

and 0(t) is the unit step function. Note the minus sign in the second term; 
this is needed because 'F Q ,(x) 1 I' / g(y) = — ^>y(y)^/ a (x) when x° / y°. 

We can now compute (0|'h Q ,(x)'I'Ty)|0) and (0|'P ( g(y)'I , a (:r)|0) by in- 
serting eqs. (42.1) and (42.2), and then using eqs. (42.3-42.5). We get 

(0|4^(x)'I> / 3( 2 /)|0) 

= Y1 j dpdp' e ipx e~ w ' y u s (p) a u s ’(p')p (0|6 s (p)6^,(p , )|0) 

s,s' 

= J dp dp' e ipx e~ tply u s ( p) a Us'{p')p {2n) 3 5 3 (p - p r ) 2 u6 ss , 

S,S / 

= J2 ! dp e vp(x - y) u s {p) a u s (p)(i 

s J 

= f dP e ip[x ~ v) 


(-i^ + m)a/3 ■ 


(42.8) 
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To get the last line, we used a result from section 38. Similarly, 
(0|^(y)^ Q (z)|0) 

= E J dp dp' e~ ipx e w ' y v s (p) a v s f(p')p (0|dy (p')4(p)l°) 

s,s' 

= E J dpdp' e ~ ipx eW ' y v s(p)aV s '(p')p (27r) 3 5 3 (p - p') 2 u5 ss > 

s,s' 

= E/ dpe- ip{x - y K s (p) a v s (p) p 

= [dp e - ip{x ~ y) - m) a p . (42.9) 


We can combine eqs. (42.8) and (42.9) into a compact formula for the time- 
ordered product by means of the identity 


d*p M x -rtf(p) 

(27 r) 4 p 2 + m 2 — ie 


= id(x°—y°) / dpe 1 - 


+ id{y°-x 0 ) J dpe~ ip{x - y) f(-p) , (42.10) 



(42.14) 



42: The Free Fermion Propagator 


270 


We can also consider (0|T^' Q ,(x)'I / ( g(y)|0) and (0|T^ r a (a;)^ , / g(y)|0), but it is 
easy to see that now there is no way to pair up a b with a or a d with a 
d\ and so 


(0|T* a (x)^(y)|0> = 0 , (42.15) 

(0|T^ a (z)^(j/)|0) = 0 . (42.16) 


Next, consider a Major ana field 


T(x) 

*(y) 


E d P [b s {v)u s {v)e ipx + b\{ V )v s (p)e~ ipx 

s=± J 

E [ d ‘P' \b\,{v)u s '{p')e~ w ' y + b s f(p)v s f(p)e ip ' y 


s'=±' 


(42.17) 

(42.18) 


It is easy to see that (0|T4 , Q ,(x)4' / 3(?/)|0) is the same as it is in the Dirac 
case; the only difference in the calculation is that we would have b and b t in 
place of d and d t in the second line of eq. (42.9), and this does not change 
the final result. Thus, 


*(0|T^ a (x)^8(y)|0) = S( x - y) a0 , (42.19) 

where S(x — y) is given by eq. (42.12). 

However, eqs. (42.15) and (42.16) no longer hold for a Majorana held. 
Instead, the Majorana condition T = T T C, which can be rewritten as 
4' T = TC -1 , implies 

i(0|TT Q (x)4' ( g(y)|0) = i(0|T^ a (s)W 7 (y)|0)(C- 1 ) 7/3 

= [S(x - y)C~ l ] a0 . (42.20) 

Similarly, using C T = C _1 , we can write the Majorana condition as T t = 
C _1 T, and so 

z(0|T^ a (x)^(y)|0) = i(C _1 ) a7 (0|T^' 7 (x)l' / 3(y)|0) 

= [C- X 5(x - y)\ a0 . (42.21) 

Of course, C^ 1 = —C, but it will prove more convenient to leave eqs. (42.20) 
and (42.21) as they are. 

We can also consider the vacuum expectation value of a time-ordered 
product of more than two fields. In the Dirac case, we must have an equal 
number of 4/’s and T’s to get a nonzero result; and then, the T’s and T’s 
must pair up to form propagators. There is an extra minus sign if the 
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ordering of the fields in their pairs is an odd permutation of the original 
ordering. For example, 

?: 2 (0|Td' a (x)^' / 3(7/)^ 7 (2;)^ ( 5(u;)|0) = + S(x - y) a/3 S(z - w) lS 

- S(x- w) a5 S(z — y) 7/3 . (42.22) 

In the Majorana case, we may as well let all the fields be T’s (since we can 
always replace a \E f with 4 /T C). Then we must pair them up in all possible 
ways. There is an extra minus sign if the ordering of the fields in their pairs 
is an odd permutation of the original ordering. For example, 

* 2 ( 0 |T^ a (x)^(y)^ 7 (z)^ 5 (-u;)| 0 ) = + [S (x - y)^ 1 }^ [S (z - w)C~% s 

- [S(x - z)C~ l } ai [S(y - w)C~ l ]p 8 

+ [^(.t - u>)C _1 ] Q(5 [S(y - z)C~ l ]/ 3 y . 

(42.23) 

Note that the ordering within a pair does not matter, since 

[S(x - y)C _1 ] Q , i a = -[S(y - x)C~ l ]p a . (42.24) 

This follows from anticommutation of the fields and eq. (42.20). 

Problems 


42.1) Prove eq. (42.24) directly, using properties of the C matrix. 
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43 The Path Integral for Fermion Fields 

Prerequisite: 9, 42 

We would like to write down a path integral formula for the vacuum- 
expectation value of a time-ordered product of free Dirac or Majorana fields. 
Recall that for a real scalar field with 


Co = -5<9 / V<9 /t </? - \m 2 y 2 

= -\tp(-d 2 + m 2 )p - ^d^ipd^ip) 


we have 


where 


< 0 ' T ^')-' 0 ) = 7I/b"' Zo(J) 


J = 0 


Z 0 (J) = j Vip exp i J d 4 x (Cq + Jg>) 


(43.1) 

(43.2) 

(43.3) 


In this formula, we use the epsilon trick (see section 6) of replacing m 2 with 
m 2 — ie to construct the vacuum as the initial and final state. Then we get 


Zq{J) = exp 


l - j d A x d 4 y J (x)A(x - y)J{y) 


where the Feynman propagator 


A(x -y) = j 


d 4 k pK x -v) 


(27t) 4 k 2 + m 2 — ie 
is the inverse of the Klein-Gordon wave operator: 

(— d 2 + m 2 ) A(x — y) = 5 4 (x — y ) . 
For a complex scalar field with 
Co = 

= -^t(_ d 2 + m 2 )y - d^d^v) 

we have instead 


(43.4) 


(43.5) 


(43.6) 


(0|T<^(xi) . . . g>\y\) • • • |0) = - 


i 5Jl(x i) % 5J(yi) 


...Zo(J\J) 


(43.7) 


j=jt=o ’ 
(43.8) 


where 


Zo{J\ J) = jv^Vip exp i J d A x ( Co + ip + J) 


= exp 


i / d 4 x d A y j\x) A(x — y) J (y) 


(43.9) 
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We treat J and jt as independent variables when evaluating eq. (43.8). 

In the case of a fermion field, we should have something similar, except 
that we need to account for the extra minus signs from anticommutation. 
For this to work out, a functional derivative with respect to an anticom- 
muting variable must itself be treated as anticommuting. Thus if we define 
an anticommuting source i](x) for a Dirac field, we can write 

77-T / d 4 y \v(y)^(y) + v(y)ri(yj\ = -^(*) , (43.10) 

OT](X) J L J 

fir 

J dSj rj{y)H(y) + F(y)r](y) = +T(x) . (43.11) 

The minus sign in eq. (43.10) arises because the 8 / 81 7 must pass through T 
before reaching 77 . 

Thus, consider a free Dirac field with 

£ 0 = - mT'F 

= — \|/(— iff) + m)\k . (43.12) 

A natural guess for the appropriate path-integral formula, based on analogy 
with eq. (43.9), is 


where 


(0|T^ ai (x’i) . . . (y\) . • • |0) 

- 1 6 6 

* i(*l) %i(?/l) 


Zo(y,y) 


11=11=0 ’ 


(43.13) 



(43.14) 


(43.15) 


(43.16) 


Note that each 8 j Sr/ in eq. (43.13) comes with a factor of i rather than the 
usual 1/i; this reflects the extra minus sign of eq. (43.10). We treat i] and 77 
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as independent variables when evaluating eq. (43.13). It is straightforward 
to check (by working out a few examples) that eqs. (43.13-43.16) do indeed 
reproduce the result of section 42 for the vacuum expectation value of a 
time-ordered product of Dirac fields. 

This is really all we need to know. Recall that, for a complex scalar 
field with interactions specified by Ci(ip',<p), we have 

« exp [i J A Z 0 (.r,J) . (43.17) 

where the overall normalization is fixed by Z(0, 0) = 1. Thus, for a Dirac 
field with interactions specified by £i(’F, T), we have 

oc exp M - n) _ (4348) 

where again the overall normalization is fixed by Z( 0,0) = 1. Vacuum 
expectation values of time-ordered products of Dirac fields in an interact- 
ing theory will now be given by eq. (43.13), but with Zo(rj,y) replaced by 
Z(j ], rf). Then, just as for a scalar field, this will lead to a Feynman-diagram 
expansion for Z(r/, ?y). There are two extra complications: we must keep 
track of the spinor indices, and we must keep track of the extra minus signs 
from anticommutation. Both tasks are straightforward; we will take them 
up in section 45. 

Next, let us consider a Majorana field with 

Co = - ±toT t CT 

= -±T T C(-^ + rn)F . (43.19) 

A natural guess for the appropriate path-integral formula, based on analogy 
with eq. (43.2), is 

(0|T 4 .«(x 1 )..,|0) = l-^ ...ZoM^, (43.20) 

where 

Zo(f?) = j 'D'F exp i J d A x (Co + rf 'F) 

= exp — ^ J d\d ir y r) T (x)S(x — y)C~ 1 rj(y) . (43.21) 

The Feynman propagator S(x — y)C -1 is the inverse of the Majorana wave 
operator C(—i<j) + to): 

C(—i<f) x + m)S(x — y)C _1 = <J 4 (x — y) . 


(43.22) 
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The extra minus sign in eq. (43.21), as compared with eq. (43.14), arises 
because all functional derivatives in eq. (43.20) are accompanied by 1/i, 
rather than half by 1/i and half by i, as in eq. (43.13). It is now straight- 
forward to check (by working out a few examples) that eqs. (43. 20-43. 22) 
do indeed reproduce the result of section 42 for the vacuum expectation 
value of a time-ordered product of Majorana fields. 
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44 Formal Development of Fermionic Path 

Integrals 

Prerequisite: 43 


In 43, we formally defined the fermionic path integral for a free Dirac field 
via 


Z 0 (rj, rj) = / VFVF exp 


i I d A xF(i(f) — m)F + ijF + Frj 


= exp 


i I d 4 x d A y rj(x)S(x — y)r](y) 


(44.1) 


where the Feynman propagator S(x — y) is the inverse of the Dirac wave 
operator: 

{—iftx + m)S(x — y) = 5 4 (x — y) . (44.2) 

We would like to find a mathematical framework that allows us to derive 
this formula, rather than postulating it by analogy. 

Consider a set of anticommuting numbers or Grassmann variables r tp t 
that obey 

bPiAj} = 0, (44.3) 

where i = 1 , ,n. Let us begin with the very simplest case of n = 1, and 
thus a single anticommuting number ip that obeys % ; 2 = 0. We can define 
a function /(V’) of such an object via a Taylor expansion; because ip 2 = 0, 
this expansion ends with the second term: 

f(ip) = a + 'ipb . (44.4) 

The reason for writing the coefficient b to the right of the variable ip will 
become clear in a moment. 

Next we would like to define the derivative of f(tp) with respect to ip. 
Before we can do so, we must decide if f(ip) itself is to be commuting 
or anticommuting; generally we will be interested in functions that are 
themselves commuting. In this case, a in eq. (44.4) should be treated as an 
ordinary commuting number, but b should be treated as an anticommuting 
number: {6, 6} = {6, ip} = 0. In this case, f{ip) = a + ipb = a — bip. 

Now we can define two kinds of derivatives. The left derivative of f(ip) 
with respect to ip is given by the coefficient of ip when f{ip) is written with 
the ip always on the far left: 


<W(V>) = +b ■ 


(44.5) 
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Similarly, the right, derivative of /(VO with respect to ip is given by the 
coefficient of V’ when /(VO is written with the ip always on the far right: 

f(ip)d^ = -b. (44.6) 


Generally, when we write a derivative with respect to a Grassmann vari- 
able, we mean the left derivative. However, in section 37, when we wrote 
the canonical momentum for a fermionic field ip as ir = dC/d(doip), we 
actually meant the right derivative. (This is a standard, though rarely 
stated, convention.) Correspondingly, we wrote the hamiltonian density as 
hi = irdoip — C, with do ip to the right of n. 

Finally, we would like to define a definite integral, analogous to inte- 
grating a real variable x from minus to plus infinity. The key features of 
such an integral over x (when it converges) are linearity, 

/ -f-oo /* + oo 

dxcf{x) = c / dxf(x) , (44.7) 

-oo J — oo 


and invariance under shifts of the dependent variable x by a constant: 



dx f(x + a) = 


r+oo 


dx f(x) 


(44.8) 


Up to an overall numerical factor that is the same for every f(ip), the 
only possible nontrivial definition of / dip f{ip) that is both linear and shift 
invariant is 

J dip f (ip) = b . (44.9) 

Now let us generalize this to n > 1. We have 


/(VO = a + ipA + ^ipnipi 2 Ci li2 + . . . + ^ipi i . . . ipi n d il ...i n , (44.10) 


where the indices are implicitly summed. Here we have written the coef- 
ficients to the right of the variables to facilitate left-differentiation. These 
coefficients are completely antisymmetric on exchange of any two indices. 
The left derivative of f(ip) with respect to ipj is 

4“ /WO = b i + & c 3i + • • • + FWi)T^2 • • • Vi,djh..,i„ ■ (44.11) 

Next we would like to find a linear, shift-invariant definition of the 
integral of f(ip). Note that the antisymmetry of the coefficients implies 
that 

di i...i n = d £{ . (44.12) 

where d is a just a number (ordinary if / is commuting and n is even, Grass- 
mann if / is commuting and n is odd, etc.), and is the completely 
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antisymmetric Levi-Civita symbol with £\___ n = +1. This number d is a 
candidate (in fact, up to an overall numerical factor, the only candidate!) 
for the integral of f(ip): 

Jd n ^fW = d. (44.13) 

Although eq. (44.13) really tells us everything we need to know about / cffy, 
we can, if we like, write d'hp = <f0n • • • dipi (note the backwards ordering), 
and treat the individual differentials as anticommuting: {dipi, dipj} = 0, 
{di/>i,ipj} = 0. Then we take / dipi = 0 and / cTi/iVl? = as our basic 
formulae, and use them to derive eq. (44.13). 

Let us work out some consequences of eq. (44.13). Consider what hap- 
pens if we make a linear change of variable, 

A = Jirfj , (44.14) 

where Jj\ is a matrix of commuting numbers (and therefore can be written 
on either the left or right of ?// ). We now have 

/(VO = a + . . . + jjfWiu'i V/J • • • (Jinjn^'j n )£ii-ind ■ (44.15) 

Next we use 

£ ii...i n Jiiji ■ • • di n j n = (det J)£j 1 ...j n , (44.16) 

which holds for any n x n matrix J, to get 

/W) = a + ... + ^ , il ...^ , in £ii...i n {detJ)d. (44.17) 

If we now integrate /(VO over , eq. (44.13) tells us that the result is 
(det J)d. Thus, 

J d n ip /(VO = (det J)” 1 J cfV /(VO • (44.18) 

Recall that, for integrals over commuting real numbers x* with X{ = JijXj , 
we have instead 

J d n x f(x) = (det J) +1 J d n x f(x) . (44.19) 

Note the opposite sign on the power of the determinant. 

Now consider a quadratic form V’ T A/V’ = V’i-^jVh'j where M is an anti- 
symmetric matrix of commuting numbers (possibly complex). Let’s evalu- 
ate the gaussian integral / d^tfj exp For example, for n = 2, we 
have 

( 0 +m \ 

M = , (44.20) 

V — m 0 J 
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and ip T MiJ) = 2raV’iV ; 2- Thus exp(^ T M^) = 1 + mV’iV’ 2 , and so 


J d n il) exp (i^ T M'i/’) = m . 


(44.21) 


For larger n, we use the fact that a complex antisymmetric matrix can be 
brought to a block-diagonal form via 


U T MU = 


( 0 +m\ 


-m i 


■J 


(44.22) 


where U is a unitary matrix, and each ?b/ is real and positive. (If n is odd 
there is a final row and column of all zeroes; from here on, we assume n is 
even.) We can now let V’i = then, we have 


cffy exp = (det U) 


-1 


n/2 

n 

i=i - 


cPi/ij exp (|'0 T A//V ; ) > (44.23) 


where Mi represents one of the 2x2 blocks in eq. (44.22). Each of these 
two-dimensional integrals can be evaluated using eq. (44.21), and so 


r n / 2 

/ d n ili exp (|^ T M^) = (det U)~ L ]^[ mi . 

J i=i 

Taking the determinant of eq. (44.22), we get 

n/2 

(det U) 2 (det M) = mj . 

i=i 

We can therefore rewrite the right-hand side of eq. (44.24) as 


(44.24) 


(44.25) 


J cffy exp = (detM) 1 / 2 . 


(44.26) 


In this form, there is a sign ambiguity associated with the square root; it 
is resolved by eq. (44.24). However, the overall sign (more generally, any 
overall numerical factor) will never be of concern to us, so we can use 
eq. (44.26) without worrying about the correct branch of the square root. 

It is instructive to compare eq. (44.26) with the corresponding gaussian 
integral for commuting real numbers, 


d n x exp (—7}X t Mx) = (27r) n//2 (det M) 1 ^ 2 


(44.27) 
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Here M is a complex symmetric matrix. Again, note the opposite sign on 
the power of the determinant. 

Now let us introduce the notion of complex Grassmann variables via 


x = +#2) 

72 ( 


X = -ToCi/h -# 2 ) • 


We can invert this to get 


> 1 ' 

> 2 , 


1 

V2 


1 1 


l —l , 


X 

,x. 


The determinant of this transformation matrix is —i, and so 
c tfif = d^d'ipi = (~i)~ 1 dxdx ■ 

Also, V’i'02 = —ixX- Thus we have 

J dxdxxx = (H)(H ) -1 J dip 2 dip 1 ^ 1^2 = 1 • 

Thus, if we have a function 

fix , X) = a + xb + xc + XXd , 


its integral is 
In particular, 


J dxdxf{x,X ) = d . 


dxdx exp (mxx) = rn 


(44.28) 

(44.29) 

(44.30) 

(44.31) 

(44.32) 

(44.33) 

(44.34) 


Let us now consider n complex Grassmann variables x% aR d their com- 
plex conjugates, Xi- We define 

d n X d n X = dxndxn • • • ^Xi^Xi • (44.35) 

Then under a change of variable, Xi = dijXj an d Xi = KijXji we have 

d n X d n x = (det J)" 1 (det K)~ l d n x d n x • (44.36) 


Note that we need not require I\ij = J*j, because, as far as the integral is 
concerned, it is does not matter whether or not Xi is the complex conjugate 
of Xi- 

We now have enough information to evaluate / d n xd n x exp(x'M%), 
where M is a general complex matrix. We make the change of variable 
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X = Ux' and x} = x'^V ■ where U and V are unitary matrices with the 
property that VMU is diagonal with positive real entries rrg . Then we get 


d n xd n x exp(x t My) = (det U) 1 (detV) 1 / dxidxi exp(mjXiXj) 


i= 1 * 
n 


= (detC7) 1 (detF) 1 m; 

i= 1 


= det M . 


(44.37) 


This can be compared to the analogous integral for commuting complex 
variables Zi = ( Xi + iyi )/\/ 2 and z = (x* — iyi)/V 2, with d n zd n z = d n xd n y, 
namely 

J d n zd n z exp (-/Jllz) = (27r) n (det M) _1 . (44.38) 

We can now generalize eqs. (44.26) and (44.37) by shifting the integra- 
tion variables, and using shift invariance of the integrals. Thus, by making 
the replacement ^ ^ — M~ l y in eq. (44.26), we get 

J dy. ) exp(d') p T M'ip + if'ip) = (det Af) 1//2 exp(d?y T Af _1 ? 7 ) . (44.39) 

(In verifying this, remember that M and its inverse are both antisym- 
metric.) Similarly, by making the replacements X X ~ A7~ 1 r/ and 
X 1 —■ *■ ~ in eq. (44.37), we get 

/ dy dy expfxt MX + + xH = (det M) exp(-,t MS) . (44.40) 

We can now see that eq. (44.1) is simply a particular case of eq. (44.40), 
with the index on the complex Grassmann variable generalized to include 
both the ordinary spin index a and the continuous spacetime argument x 
of the field 4 , Q (a;). Similarly, eq. (43.21) for the path integral for a free 
Majorana field is simply a particular case of eq. (44.39). In both cases, the 
determinant factors are constants (that is, independent of the fields and 
sources) that we simply absorb into the overall normalization of the path 
integral. We will meet determinants that cannot be so neatly absorbed in 
sections 53 and 71. 
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45 The Feynman Rules for Dirac Fields 

Prerequisite: 10, 12, 41, 43 


In this section we will derive the Feynman rules for Yukawa theory , a theory 
with a Dirac field \F (with mass to) and a real scalar field <p (with mass 
M), interacting via 

Ci = gpFF , (45.1) 


where g is a coupling constant. In this section, we will be concerned with 
tree-level processes only, and so we omit renormalizing Z factors. 

In four spacetime dimensions, ip has mass dimension [ <p\ = 1 and \F has 
mass dimension [4/] = §; thus the coupling constant g is dimensionless: 
[g\ = 0. As discussed in section 12, this is generally the most interesting 
situation. 

Note that C\ is invariant under the U(l) transformation T — > e~’ ia '4 > , 
as is the free Dirac lagrangian. Thus, the corresponding Noether current 
'Fy^'F is still conserved, and the associated charge Q (which counts the 
number of 6-type particles minus the number of d-type particles) is constant 
in time. 

We can think of Q as electric charge, and identify the 6-type particle 
as the electron e~ , and the d-type particle as the positron e + . The scalar 
particle is electrically neutral (and could, for example, be thought of as the 
Higgs boson; see section 88). 

We now use the general result of sections 9 and 43 to write 


Z(r/. rj, J) oc exp 


R] 


d A x ( 


i 5J(x) 



\i 8r] a (x ) 


where 


z o(r , 
(45.2) 


and 


z o(r] ,V,J) = exp i ^ 
x exp 


<Tx dry rj(x)S(x — y)rj(y ) 
l - J d A xd 4 y J(x)A(x - y)J{y) , 


S{x 

A(x 


y) = j 

y) = J 


d^p (— ^ + m)e ip ( x ^ 
(27 r) 4 p 2 + m 2 — ie 

d 4 k 

(27t) 4 k 2 + M 2 - ie 


(45.3) 

(45.4) 

(45.5) 


are the appropriate Feynman propagators for the corresponding free fields. 
We impose the normalization Z(0,0, 0) = 1, and write 


Z (V,V , J ) = exp[iW(?7,?7, J)] . 


(45.6) 
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Figure 45.1: Tree contributions to iW(r/,r /, J) with four or fewer sources. 

Then iW ( rj , rj , J) can be expressed as a series of connected Feynman dia- 
grams with sources. 

We use a dashed line to stand for the scalar propagator jA(x — y), 
and a solid line to stand for the fermion propagator \S(x — y). The only 
allowed vertex joins two solid lines and one dashed line; the associated 
vertex factor is ig. The blob at the end of a dashed line stands for the <p 
source i J d 4 x J(x), and the blob at the end of a solid line for either the 4/ 
source i f d 4 xrj(x), or the T source i f d 4 xrj(x). To tell which is which, we 
adopt the “arrow rule” of problem 9.3: the blob stands for i f d 4 x r/(x) if the 
arrow on the attached line points away from the blob, and the blob stands 
for i J d 4 xrj(x) if the arrow on the attached line points towards the blob. 
Because C\ involves one 4/ and one T, we also have the rule that, at each 
vertex, one arrow must point towards the vertex, and one away. The first 
few tree diagrams that contribute to iW (rj, rj, J) are shown in fig. (45.1). We 
omit tadpole diagrams; as in y? 3 theory, these can be cancelled by shifting 
the (p field, or, equivalently, adding a term linear in ip to C. The LSZ 
formula is valid only after all tadpole diagrams have been cancelled in this 
way. 

The spin indices on the fermionic sources and propagators are all con- 
tracted in the obvious way. For example, the complete expression corre- 
sponding to fig. (45.1)(b) is 

Fig. (45.1)(b) = (ig) J d 4 x d 4 y d 4 z d 4 w 

x rj(x)S(x — y)S(y — z)g(z) 
xA (y-w)J(w). (45.7) 

Our main purpose in this section is to compute the tree-level amplitudes 
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Figure 45.2: Diagrams corresponding to eq. (45.8). 


for various two-body elastic scattering processes, such as e ~ — ► e~ ip and 
e + e _ — > (p(p-, for these, we will need to evaluate the tree-level contributions 
to connected correlation functions of the form (0|T’F\E , <^y>|0)o Other pro- 
cesses of interest include e _ e _ — > e _ e _ and e + e~ — > e + e _ ; for these, we 
will need to evaluate the tree-level contributions to connected correlation 
functions of the form (0|T4'4 , 'I'4'|0)c. 

For (0|T'F'I'(^</?|0)c, the relevant tree-level contribution to iW(r},rj, J) 
is given by fig. (45.1)(c). We have 


{0\T^ a {x)^ p(y)ip{zi)ip{z 2 )\f))c 


I - i - I - I - iWlrj ri J ) 

i 5rJ Q (x) 8rjp(y) i 6J(zi) i 6 J(z 2 ) 


rj=r]=J = 0 


= (i ) 5 {ig ) 2 J d 4 W! d A w 2 

x [S(x-wi)S(wi-w 2 )S(w 2 -y)] a p 
x A(zi-wi)A(z 2 —w 2 ) 

+ z-z) + Q(g 4 ) . (45.8) 


The corresponding diagrams, with sources removed, are shown in fig. (45.2). 

For (0|T\I'\E , \I , \E , |0)c, the relevant tree-level contribution to iW(rj,r), J) 
is given by fig. (45.1)(d), which has a symmetry factor S = 2. We have 


(0|TT Ql (xi)T a {yi)'f'a 2 {x 2 )^p 2 (?/2)|0)c 


15 5 15 .5 

7 l 

* dy ai (xi ) S-rifa (yi ) i 8rj a2 (x 2 ) Sr/p 2 (y 2 ) 


iW(r/,rj,J ) 


•q=T)=J = 0 

(45.9) 


The two i] derivatives can act on the two r/’s in the diagram in two different 
ways; ditto for the two rj derivatives. This results in four different terms, 
but two of them are algebraic duplicates of the other two; this duplication 
cancels the symmetry factor (which is a general result for tree diagrams). 
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>'i vv’i Xl y, x 2 

7 i 7 7 i 7 


> ’ > 


> ' > 


y 2 w 2 x 2 )>2 w 2 x \ 


Figure 45.3: Diagrams corresponding to eq. (45.10). 


We get 

(0|T'F ai (xi)^ ( yi )* a2 (x 2 )^ 2 ( 2 / 2 ) 1 0) c 

= (jf (dj ) 2 j d A wi d 4 iv 2 

x [S , (xi-u;i)S , (r(;i-yi)] Q;i/ 3 1 
x A(uq — w 2 ) 

x [S(x 2 -w 2 )S(w 2 -y 2 )\ a2 p 2 

~ ((yi,A) <-> ( 2 / 2 , /? 2 )) +0(g 4 ) . (45.10) 

The corresponding diagrams, with sources removed, are shown in fig. (45.3). 
Note that we now have a relative minus sign between the two diagrams, 
due to the anticommutation of the derivatives with respect to rj. 

In general, the overall sign for a diagram can be determined by the 
following procedure. First, draw each diagram with all the fermion lines 
horizontal, with their arrows pointing from left to right, and with the left 
endpoints labeled in the same fixed order (from top to bottom). Next, in 
each diagram, note the ordering (from top to bottom) of the labels on the 
right endpoints of the fermion lines. If this ordering is an even permutation 
of an arbitrarily chosen fixed ordering, then the sign of that diagram is 
positive, and if it is an odd permutation, the sign is negative. (This rule 
arises because endpoints with arrows pointing away from the vertex come 
from derivatives with respect to rj that anticommute. Of course, we could 
equally well put the right endpoints in a fixed order, and get the sign from 
the permutation of the left endpoints, which come from derivatives with 
respect to g that anticommute.) Also, in loop diagrams, a closed fermion 
loop yields an extra minus sign; we will discuss this rule in section 51. 

Let us now consider a particular scattering process: e~<p — ► e~<p. The 
scattering amplitude is 

(f\i) = (0| T a(k / )out&s'(p / )out&i(p)inO t (k)i 11 10) . (45.11) 

Next we make the replacements 

^(P)in -> i J d 4 y '&(y)(+i@ + m)u s ( p)e +ipy , (45.12) 
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p p+k p ' p p-k' p / 

> T > T > > T > T > 

k A \Zk' + k ' V A& 

II II 


Figure 45.4: Diagrams for e — > e 99 , corresponding to eq. (45.16). 


, s l (P )out 

- 9 - i j d 4 x e ip ' x u s i(p')(— ify + m)’F(x) , 

(45.13) 

^(k)^ - 

■* i J d 4 z 1 e +ikzi {-d 2 + m 2 )cp{z 1 ) , 

(45.14) 

a( k/ )out - 

i J d 4 z 2 e~ ik ' Z2 (-d 2 + m 2 )cp(z 2 ) 

(45.15) 


in eq. (45.11), and then use eq. (45.8). The wave operators (either Klein- 
Gordon or Dirac) act on the external propagators, and convert them to 
delta functions. After using eqs. (45.4) and (45.5) for the internal propa- 
gators, all dependence on the various spacetime coordinates is in the form 
of plane-wave factors, as in section 10. Integrating over the internal co- 
ordinates then generates delta functions that conserve four-momentum at 
each vertex. The only new feature arises from the spinor factors u s (p) and 
rv(p'). We find that u s (p) is associated with the external fermion line 
whose arrow points towards the vertex, and that Us'(p') is associated with 
the external fermion line whose arrow points away from the vertex. We can 
therefore draw the momentum-space diagrams of fig. (45.4). Since there is 
only one fermion line in each diagram, the relative sign is positive. The 
tree- level e~ p — > e~ ip scattering amplitude is then given by 

- lj + m + ty' + 

—s + m 2 —u + m 

where s = — (p+k) 2 and u = —(p—k 1 ) 2 . (We can safely ignore the ids in the 
propagators, because their denominators cannot vanish for any physically 
allowed values of s and u.) 

Next consider the process e + (p — > e + p. We now have 

(f\i) = (0| T a(k , )out4'(p / )out4(p)in« t ( k )in |0> . (45.17) 

The relevant replacements are 


iT e -<p-+ e - v = u s '{p') 


m 


u s { P) 


(45.16) 


d A x e +ipx v s (p + m)’F(x) , 


(45.18) 
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+ 
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Figure 45.5: Diagrams for — ► e + <p, corresponding to eq. (45.22). 


(s' (p )out 

-i J d A y T(y)(+i^ +m)v s >{ p ; ) e ipy , 

(45.19) 

O t (k)i I1 - 

■* i J d 4 z i e +ikzi (-d 1 2 + m 2 )p(z i) , 

(45.20) 

®(k )out - 

■* i [ d 4 z 2 e~ ikZ2 (-d 2 + m 2 )p(z 2 ) . 

(45.21) 


We substitute these into eq. (45.17), and then use eq. (45.8). This ulti- 
mately leads to the momentum-space Feynman diagrams of fig. (45.5). Note 
that we must now label the external fermion lines with minus their four- 
momenta; this is characteristic of d-type particles. (The same phenomenon 
occurs for a complex scalar field; see problem 10.2.) Regarding the spinor 
factors, we find that — u s (p) is associated with the external fermion line 
whose arrow points away from the vertex, and —v s i (p ; ) with the external 
fermion line whose arrow points towards the vertex. The minus signs at- 
tached to each v and v can be consistently dropped, however, as they only 
affect the overall sign of the amplitude (and not the relative signs among 
contributing diagrams). The tree-level expression for the e + <p — > e + ip am- 
plitude is then 

l) — lj' + m + 1£ + m 
—u + m 2 — s + m 2 


) , (45.22) 


iTe+V^e+v = j(ig) 2 V s { P) 


where again s = — (p + k) 2 and u = —(p — k') 2 . 

After working out a few more of these (you might try your hand at 
some of them before reading ahead), we can abstract the following set of 
Feynman rules. 


1. For each incoming electron, draw a solid line with an arrow pointed 
towards the vertex, and label it with the electron’s four-momentum, 
Pi- 

2. For each outgoing electron, draw a solid line with an arrow pointed 
away from the vertex, and label it with the electron’s four-momentum, 

Pi- 
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3. For each incoming positron, draw a solid line with an arrow pointed 
away from the vertex, and label it with minus the positron’s four- 
momentum, —pi . 

4. For each outgoing positron, draw a solid line with an arrow pointed 
towards the vertex, and label it with minus the positron’s four-momentum, 



5. For each incoming scalar, draw a dashed line with an arrow pointed 
towards the vertex, and label it with the scalar’s four- momentum, k t . 


6. For each outgoing scalar, draw a dashed line with an arrow pointed 
away from the vertex, and label it with the scalar’s four-momentum, 

K ' 

7. The only allowed vertex joins two solid lines, one with an arrow point- 
ing towards it and one with an arrow pointing away from it, and one 
dashed line (whose arrow can point in either direction). Using this 
vertex, join up all the external lines, including extra internal lines as 
needed. In this way, draw all possible diagrams that are topologically 
inequivalent. 

8. Assign each internal line its own four-momentum. Think of the four- 
momenta as flowing along the arrows, and conserve four-momentum 
at each vertex. For a tree diagram, this fixes the momenta on all the 
internal lines. 


9. The value of a diagram consists of the following factors: 
for each incoming or outgoing scalar, 1; 
for each incoming electron, u Si (pi); 
for each outgoing electron, up (p(); 
for each incoming positron, u Si (pi); 
for each outgoing positron, vp (p(); 
for each vertex, ig\ 

for each internal scalar, — i/(k 2 + M 2 — ie); 

for each internal fermion, —i{—]j> + m)/(p 2 + m? — ie). 

10. Spinor indices are contracted by starting at one end of a fermion line: 
specifically, the end that has the arrow pointing away from the vertex. 
The factor associated with the external line is either u or v. Go along 
the complete fermion line, following the arrows backwards, and write 
down (in order from left to right) the factors associated with the 
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Figure 45.6: Diagrams for e + e — ► pp, corresponding to eq. (45.23). 


vertices and propagators that you encounter. The last factor is either 
a u or v. Repeat this procedure for the other fermion lines, if any. 

11. The overall sign of a tree diagram is determined by drawing all con- 
tributing diagrams in a standard form: all fermion lines horizontal, 
with their arrows pointing from left to right, and with the left end- 
points labeled in the same fixed order (from top to bottom); if the 
ordering of the labels on the right endpoints of the fermion lines in a 
given diagram is an even (odd) permutation of an arbitrarily chosen 
fixed ordering, then the sign of that diagram is positive (negative). 

12 . The value of VT (at tree level) is given by a sum over the values of 
the contributing diagrams. 


There are additional rules for counterterms and loops; in particular, each 
closed fermion loop contributes an extra minus sign. We will postpone 
discussion of loop corrections to section 51. 

Let us apply these rules to e + e _ — * (pip. Let the initial electron and 
positron have four-momenta pi and p 2 . respectively, and the two final 
scalars have four-momenta k[ and k 2 . The relevant diagrams are shown 
in fig. (45.6); there is only one fermion line, and so the relative sign is 
positive. The result is 


^e+e — - 


1 / • \2 _ ( \ ~i>\ + tfl + m , — i>\ +ft' 2 + m , \ 

= i\m ^ 2 (p2) — 1 . , 1 2 — + — , 9 «*i(pi)» 

—t + — U + 


(45.23) 

where t = —(pi — k [ ) 2 and u = —(pi — k' 2 ) 2 ■ 

Next, consider e _ e _ — ► e - e~. Let the initial electrons have four- 
momenta pi and p 2 , and the final electrons have four-momenta p'i and 
p 2 . The relevant diagrams are shown in fig. (45.7), and according to rule 
#11 the relative sign is negative. Thus the result is 




45: The Feynman Rules for Dirac Fields 


290 



Figure 45.8: Diagrams for e + e — > e + e , corresponding to eq. (45.25). 


where u i is short for w si (pi), etc., and t = —{p\ — Pi) 2 , u = — {pi — # 2 ) 2 . 

One more: e + e _ e + e _ . Let the initial electron and positron have 
four-momenta p± and p 2 , respectively, and the final electron and positron 
have four-momenta p' x and p' 2 , respectively. The relevant diagrams are 
shown in fig. (45.8). If we redraw them in the the standard form of rule 
#11, as shown in fig. (45.9), we see that the relative sign is negative. Thus 
the result is 


0 u{ui)(v 2 V2 ) 

-t + M 2 


(v2Ul)(u[v' 2 ) 
-u + M 2 



Figure 45.9: Same as fig. (45.8), but with the diagrams redrawn in the 
standard form given in rule #11. 
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where s = —(pi + P 2) 2 and t = —(pi — p'i) 2 - 

Problems 

45.1) a) Determine how p(x) must transform under parity, time reversal, 
and charge conjugation in order for these to all be symmetries of the 
theory. (Prerequisite: 40) 

b) Same question, but with the interaction given by L\ = igp'f’ 75 \H 
instead of eq. (45.1). 

45.2) Use the Feynman rules to write down (at tree level) iT for the pro- 
cesses e + e + — > e + e + and cpp — > e + e - . 
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46 Spin Sums 

Prerequisite: 45 


In the last section, we calculated various tree-level scattering amplitudes in 
Yukawa theory. For example, for e~<p e~ip we found 


T = g 2 u s '( p') 


—]j> — If + m —]) + If' + m 
—s + m 2 — u + m 2 


u s ( p) , 


(46.1) 


where s = — {p + k ) 2 and u = —(p — k') 2 . In order to compute the cor- 
responding cross section, we must evaluate |T| 2 = TT*. We begin by 
simplifying eq. (46.1) a little; we use (rf + m)u s ( p) = 0 to replace the in 
each numerator with m. We then abbreviate eq. (46.1) as 

T = u'Au , (46.2) 


where 

. o [— k + 2 m k! + 2 m 

A = g 2 ~2 + —o 

m A — s m z — u 


(46.3) 


Then we have 

T* = T = u'Au = uAv! , (46.4) 

where in general A = (3 A' (3, and, for the particular A of eq. (46.3), A = A. 
Thus we have 


|T| 2 = (• u'Au){uAu ') 

^ a-A a gUpU'yA^fiU£ l 
a(3^8 

^ ^ a A a f3U/3UryA.ryfi 

a/3jS 

= Tr {u'u')A{uu)A . (46.5) 

Next, we use a result from section 38: 

u s (p)u s {p) = + m) , (46.6) 

where s = =t tells us whether the spin is up or down along the spin quan- 
tization axis z. We then have 

|T| 2 = ±Tr (1 — s / 75^ / )(— / + m)A(l- rf + m)A . (46.7) 

We now simply need to take traces of products of gamma matrices; we will 
work out the technology for this in the next section. 
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However, in practice, we are often not interested in (or are unable to 
easily measure or prepare) the spin states of the scattering particles. Thus, 
if we know that an electron with momentum p' landed in our detector, but 
know nothing about its spin, we should sum \T\ 2 over the two possible spin 
states of this outgoing electron. Similarly, if the spin state of the initial 
electron is not specially prepared for each scattering event, then we should 
average \T\ 2 over the two possible spin states of this initial electron. Then 
we can use 

Y u s (p)u s (p) = + m (46.8) 

S=zb 

in place of eq. (46.6). 

Let us, then, take |T| 2 , sum over all final spins, and average over all 
initial spins, and call the result (|T| 2 ). In the present case, we have 

<iti 2 } = \ Y \ T \ 2 

s,s' 

= |Tr (—$' + m)A(—rf + m)A , (46.9) 

which is much less cumbersome than eq. (46.7). 

Next let’s try something a little harder, namely e + e _ — > e + e - . We 
found in section 45 that 


2 r K'Ui)(u 2 ^) {v2Ul){u[v' 2 y 

9 [ AP-t AP - s . ' 


(46.10) 


We then have 


~= = 2 \ (»l'»l)(^2^2) {UIV 2){v2 u 'l) 

9 [ AP-t AP — S 


(46.11) 


When we multiply T by T, we will get four terms. We want to arrange the 
factors in each of them so that every u and every v stands just to the left 
of the corresponding u and v. In this way, we get 

(A r i r i 

\T\ 2 = + , ^2 /j2 Tr uiuiu[u[ Tr 

g 4 r - - 1 r /-/ 

+ , ^2 s ,2 Tr u 1 u 1 v 2 V 2 Tr 

g A r - - - - 1 

- (Af2_ t)(M 2_ s) 

g ^ r till 1 

- (M 2 — s)(M 2 — t) ^[ u ^ u iuW2V2 v 2V 2 \ . (46.12) 
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Then we average over initial spins and sum over final spins, and use eq. (46.8) 
and 

^2 v s (p)v s (p) = ~i>- m . (46.13) 

S=± 

We then must evaluate traces of products of up to four gamma matrices. 
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47 Gamma Matrix Technology 

Prerequisite: 36 

In this section, we will learn some tricks for handling gamma matrices. We 


need the following information as a starting point: 


= -2<r , 

(47.1) 

75 = G 

(47.2) 

O 

II 

LO 

(47.3) 

Tr 1 = 4 . 

(47.4) 

Now consider the trace of the product of n gamma matrices. 

We have 


Tr[ 7 w . . . 7 ^ n ] = Tr[ 7 f 7 w 7 f . . . yf 7 Mn ] 

= Tr[(757 w 75) • • • ( 757 ^ 75 )] 

= TV[(- 75 V 1 ).--(-7 5 V”)] 

= (-l) n Tr[ 7 w ... 7 ^] . (47.5) 

We used eq. (47.2) to get the first equality, the cyclic property of the trace 
for the second, eq. (47.3) for the third, and eq. (47.2) again for the fourth. 
If n is odd, eq. (47.5) tells us that this trace is equal to minus itself, and 
must therefore be zero: 

Tr[odd # of 7 ^’s] = 0 . (47.6) 

Similarly, 

Trfys ( odd # of 7 ^’s ) ] = 0 . (47.7) 

Next, consider T ^[ 7 ^ 7 "]. We have 

Tr[ 7 G 1 = Tr[ 7 V] 

= ^ TV [ 7 ^ 7 ^ + 7 V] 

= -g^ Tr 1 

= -4 . (47.8) 

The first equality follows from the cyclic property of the trace, the second 
averages the left- and right-hand sides of the first, the third uses eq. (47.1), 
and the fourth uses eq. (47.4). 

A slightly nicer way of expressing eq. (47.8) is to introduce two arbitrary 
four- vectors and b^, and write 


Tr[^ = — 4(a6) 


(47.9) 
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where f, = a M 7 M , ) = 7 M , and (a&) = a^b^. 

Next consider Tr [(fiif<f:(/l] . We evaluate this by moving <jr to the right, using 
eq. (47.1), which is now more usefully written as 

</l) = — fj, — 2(ab) . (47.10) 


Using this repeatedly, we have 

Tr[^^f] = — Tr\ty</u/:(/l) — 2(ab)Tr[<^(/L] 

= +Tr[^j ijufl] + 2(ac)Tr[(f(l] — 2(a&)Tr[^j$] 

= — Tr[^^] — 2(ad)Tr[^] + 2(ac)Tr[^] — 2(a&)TV[jZ^] . (47.11) 

Now we note that the first term on the right-hand side of the last line is, 
by the cyclic property of the trace, equal to minus the left-hand side. We 
can then move this term to the left-hand side to get 


2Tr \/m\ = - 2(ad)Tr[^ + 2(ac)Tr[ftf] - 2(a6)Tr[^] . (47.12) 

Finally, we evaluate each Tr with eq. (47.9), and divide by two: 


Tr [</ifi4\ 


4 


(ad) (be) — (ac)(bd) + (ab)(cd) . 


(47.13) 


This is our final result for this trace. 

Clearly, we can use the same technique to evaluate the trace of the 
product of any even number of gamma matrices. 

Next, let’s consider traces that involve 75 ’s and 7^’s. Since {75, 7^} = 0, 
we can always bring all the 7s’s together by moving them through the 7 M ’s 
(generating minus signs as we go). Then, since 75 = 1, we end up with 
either one 75 or none. So we need only consider Tr[757 Ml ...7^"]. And, 
according to eq. (47.7), we need only be concerned with even n. 

Recall that an explicit formula for 75 is 

75 = i7°7 1 7 2 7 3 • (47.14) 

Eq. (47.13) then implies 

Tr 7 5 = 0 . (47.15) 

Similarly, we can show that 

Tr[757 / V] = 0 • (47.16) 


Finally, consider Tr^y^y^^ 0 "]. The only way to get a nonzero result is to 
have the four vector indices take on four different values. If we consider the 
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special case Tr[ 7 s 7 3 7 2 7 1 7 °], plug in eq. (47.14), and then use (y *) 2 = — 1 
and ( 7 0 ) 2 = 1, we get i(— l ) 3 Tr 1 = —4 i, or equivalently 

Tr[757 M 7 !y 7 P 7 CT ] = -4 ie^ pa , (47.17) 

where £ 0123 = e 3210 = + 1 . 

Another category of gamma matrix combinations that we will eventually 
encounter is 7 ^ji ... 7 ^. The simplest of these is 

7 m 7m = SV7 / V 

= -g,^r 

= -d. (47.18) 

To get the second equality, we used the fact that g^ w is symmetric, and 
so only the symmetric part of 7 ^ 7 ^ contributes. In the last line, d is the 
number of spacetime dimensions. Of course, our entire spinor formalism 
has been built around d = 4, but we will need formal results for d = 4— e 
when we dimensionally regulate loop diagrams involving fermions. 

We move on to evaluate 



'fill*. = 7 m (-7a^ - 2o m) 
= -7^7/^ - 
= {d—2)(/i . 

(47.19) 

We continue with 


= 4(a6) - (d-4)jf 

(47.20) 

and 


7 p 4^7m = 2 m + {d—4)ff<f : ; 

(47.21) 


the derivations are left as an exercise. 


Problems 


47.1) Verify eq. (47.16). 

47.2) Verify eqs. (47.20) and (47.21). 

47.3) Show that the most general 4x4 matrix can be written as a linear 

combination (with complex coefficients) of 1 , 7^, , 7^75, and 75, 

where 1 is the identity matrix and S^ u = | [ 7 ^ > 7 ^] • Hint: if A and B 
are two different members of this set, prove linear independence by 
showing that Tr B = 0 vanishes. Then count. 
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48 Spin-Averaged Cross Sections 

Prerequisite: 46, 47 


In section 46, we computed \T\ 2 for (among other processes) e + e _ — * e + e~. 
We take the incoming and outgoing electrons to have momenta p\ and p\, 
respectively, and the incoming and outgoing positrons to have momenta p -2 
and p' 2 , respectively. We have pf = pf = — m 2 , where m is the electron 
(and positron) mass. The Mandelstam variables are 

s = — (Pi +P2? = -(pi +P2 f , 
t = -(pi - Pi) 2 = (P2 P 2) 2 , 

« = — (Pi - Pa) 2 = ~(P2 - Pi) 2 , (48.1) 


and they obey s + t + u = 4m 2 . Our result was 


m 2 = 5 4 


4>, 


$st + *h 


ts 


+ 


4> 


tt 


(M 2 — s) 2 (M 2 — s){M 2 — t) (M 2 — t) 2 \ 


(48.2) 


where M is the scalar mass, and 


$ss = Tr 




Tr 


V 2 V 2 U \ u \ 


&tt = Tr 
$st = Tr 
$ts = Tr 


UlUlUiu[ 


Tr 


V2V2V2V2 


UlUlUiu[v' 2 V 2 V2V2 


UlUlV2V2V 2 V 2 U , 1 u[ 


(48.3) 


Next, we average over the two initial spins and sum over the two final 
spins to get 


Then we use 

X! u s (p)u s (p) = -rf + m , 

S=± 

(48.4) 

to get 

v s{P)vs(p) = -rf-m, 

S=d= 

(48.5) 

<$«> 

= |TV (-A+ m )(“A _m ) Tr (-A“ m )(-A+ m ) * 

(48.6) 

<*tt> 

= |TV (-^ 1 +m)(-$+m) Tr (-^-m)(-^ 2 -m) , 

(48.7) 

(**t) 

= i Tr (-^i+m)(-^+m)(-|l2-m)(-^ 2 -m) , 

(48.8) 

(*ts) 

= |Tr [(— ^ 1 +m)(— A - "r)(— ^2 _ m )( _ • 

(48.9) 
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It is now merely tedious to evaluate these traces with the technology of 
section 47. 

For example, 

TV (— ^!+m)(— rf 2 — tti) = TV^^] — in 2 Tr 1 

= —4(pip-2) — 4 m 2 , (48.10) 

It is convenient to write four-vector products in terms of the Mandelstam 
variables. We have 

P 1 P 2 = P\P 2 = -|(« - 2m 2 ) , 
pip'i = P2P2 = +U t ~ 2m2 ) > 

P 1 P 2 = P 1 P 2 = - 2m 2 ) , (48.11) 

and so 

Tr (— ^ 1 +m)(— 1 ) 2 — m) = 2s — 8m 2 . (48.12) 

Thus, we can easily work out eqs. (48.6) and (48.7): 

(<4> ss ) = {s- 4 m 2 ) 2 , (48.13) 

<$tt> = (t - 4m 2 ) 2 . (48.14) 

Obviously, if we start with (<5 SS ) and make the swap s t, we get 
We could have anticipated this from eqs. (48.6) and (48.7): if we start with 
the right-hand side of eq. (48.6) and make the swap P 2 —p\ , we get 
the right-hand side of eq. (48.7). But from eq. (48.11), we see that this 
momentum swap is equivalent to s t. 

Let’s move on to (3> st ) and These two are also related by P 2 

—p'l, and so we only need to compute one of them. We have 

(®st) = |Tr^^ 2 ] 

+ \m 2 Tr[^ - - #^2 “ 2^2 + ^ 2 ] + I™ 4 Tr 1 

= {PiPi)(P 2 P 2 ) ~ {piP 2 ){P 2 Pi) + (P1P2W1P2) 

- m 2 [pxp\ - P 1 P 2 - P 1 P 2 ~ P 1 P 2 - P 1 P 2 + P 2 P 2 ] + m 4 
= — \st + 2m 2 u . (48.15) 

To get the last line, we used eq. (48.11), and then simplified it as much as 
possible via s + t + u = 4m 2 . Since our result is symmetric on s <-> t, we 
have (<F ts ) = ($ st ). 

Putting all of this together, we get 

(s — 4m 2 ) 2 t st — 4m 2 u | ( t — Am 2 ) 2 
(M 2 - s) 2 + l M 2 - s)(M 2 -t) + (Af 2 - t ) 2 


(l^| 2 )=/ 


(48.16) 
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This can then be converted to a differential cross section (in any frame) via 
the formulae of section 11. 

Let’s do one more: e~ip — > e~(p. We take the incoming and outgoing 
electrons to have momenta p and p ' , respectively, and the incoming and 
outgoing scalars to have momenta k and k ! , respectively. We then have 
p 2 = p' 2 = — m 2 and k 2 = k' 2 = —M 2 . The Mandelstam variables are 

s = — (p + k) 2 = -(p + k') 2 , 
t = -(■ p-p) 2 = -( k - k') 2 , 
u = —(jp — k') 2 = —{k — p') 2 , (48.17) 


and they obey s + t + u = 2 m 2 + 2 M 2 . Our result in section 46 was 



(|T| 2 ) = ±Tr A(-$ + m)A { -/ + m) , 

(48.18) 

where 

. o [— k + 2 m k! + 2m 

a = 9 — 2 + — 2 • 

m z — s — u 

(48.19) 

Thus we 

i have 



2\ 4 (^ss) . (4?su) + (^us) . (^u«) 

^ [ ( m 2 — s) 2 (m 2 — s){m 2 — u ) ( m 2 — u) 2 \ ’ 

(48.20) 

where now 



(<f> ss ) = |Tr (— ^ , +m)(-#+2m)(-^+m)(-#+2ra) , 

(48.21) 


{®uu} = 5 Tr (-/ +m)(+ft' +2m,)(-rf+m)(+ljk' +2m) , 

(48.22) 


{$su) = |Tr (-/ +m)(-l/+2m)(-tf-\-m)(+l/' +2m) , 

(48.23) 


($ us ) = |Tr (-/+m)(+^ / +2rn)(-^+m)(-^+2m) . 

(48.24) 

We can evaluate these in terms of the Mandelstam variables by using our 
trace technology, along with 


pk = pk' = —^(s — m 2 — M 2 ) , 



pp = +\{t- 2m 2 ) , 
kk! = +±(t - 2M 2 ) , 



CM 

1 

CM 

1 

i-H|CM 

+ 

II 

-S£ 

"a, 

II 

Tse 

a, 

(48.25) 


Examining eqs. (48.21) and (48.22), we see that (<3? ss ) and (3> uu ) are trans- 
formed into each other by k —k'. Examining eqs. (48.23) and (48.24), we 
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see that (3> su ) and (<h us ) are also transformed into each other by k <-> —k!. 
From eq. (48.25), we see that this is equivalent to s u. Thus we need 
only compute (3> ss ) and (<& su ), and then take s««to get ($„„) and (4> us ). 
This is, again, merely tedious, and the results are 


(®ss) 

= —su + m 2 (9s + u) 

+ 7m 4 

- 8 m 2 M 2 + M 4 , 

(48.26) 

($uu) 

= — su + m 2 (9u + s ) 

+ 7m 4 

- 8 m 2 M 2 + M 4 , 

(48.27) 

{$su) 

= +su + 3 m 2 (s + u) 

+ 9 m 4 

- 8 m 2 M 2 - M 4 , 

(48.28) 

{$us) 

= +su + 3 m 2 (s + u) 

+ 9m 4 

- 8m 2 M 2 - Af 4 . 

(48.29) 


Problems 




48.1) The tedium of these calculations is greatly alleviated by making use of 
a symbolic manipulation program like Mathematica or Maple. One 
approach is brute force: compute 4x4 matrices like in the CM 
frame, and take their products and traces. If you are familiar with a 
symbolic-manipulation program, write one that does this. See if you 
can verify eqs. (48.26-48.29). 

48.2) Compute (|T| 2 ) for e + e“ — ► (ftp . You should find that your result 
is the same as that for e~p — > e~ g>, but with s t, and an extra 
overall minus sign. This relationship is known as crossing symmetry. 
There is an overall minus sign for each fermion that is moved from 
the initial to the final state. 

48.3) Compute (|T| 2 ) for e _ e _ — > e _ e“. You should find that your result is 
the same as that for e + e~ — > e + e _ , but with s u. This is another 
example of crossing symmetry. 

48.4) Suppose that M > 2m, so that the scalar can decay to an electron- 
positron pair. 

a) Compute the decay rate, summed over final spins. 

b) Compute |T| 2 for decay into an electron with spin si and a positron 
with spin S 2 - Take the fermion three- momenta to be along the z axis, 
and let the x-axis be the spin-quantization axis. You should find 
that |T| 2 = 0 if si = — S 2 , or if M = 2 m (so that the outgoing 
three-momentum of each fermion is zero). Discuss this in light of 
conservation of angular momentum and of parity. (Prerequisite: 40.) 

c) Compute |T| 2 for decay into an electron with helicity si and a 
positron with helicity s 2 - (See section 38 for the definition of helicity.) 
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You should find that the decay rate is zero if si = — s 2 - Discuss this 
in fight of conservation of angular momentum and of parity. 

d) Now consider changing the interaction to C\ = igipty 75 \H, and com- 
pute the spin-summed decay rate. Explain (in light of conservation of 
angular momentum and of parity) why the decay rate is larger than 
it was without the i 75 in the interaction. 

e) Repeat parts (b) and (c) for the new form of the interaction, and 
explain any differences in the results. 

48.5) The charged pion 7 is represented by a complex scalar field ip, the 
muon g~ by a Dirac field M, and the muon neutrino v tl by a spin- 
projected Dirac field P L Af, where P L = 1(1—75). The charged pion 
can decay to a muon and a muon antineutrino via the interaction 

C\ = 2ciGpf 7t d ll ipM^Pi J M + h.c. , (48.30) 

where c\ is the cosine of the Cabibbo angle, Gp is the Fermi constant, 
and f n is the pion decay constant. 

a) Compute the charged pion decay rate T. 

b) The charged pion mass is m n = 139.6 MeV, the muon mass is 

= 105.7 MeV, and the muon neutrino is massless. The Fermi 
constant is measured in muon decay to be Gp = 1.166 x 10 -5 GeV -2 , 
and the cosine of the Cabibbo angle is measured in nuclear beta decays 
to be ci = 0.974. The measured value of the charged pion lifetime is 
2.603 x 10~ 8 s. Determine the value of f w in MeV. Your result is too 
large by 0.8%, due to neglect of electromagnetic loop corrections. 
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49 The Feynman Rules for Majorana Fields 

Prerequisite: 45 

In this section we will deduce the Feynman rules for Yukawa theory, but 
with a Majorana field instead of a Dirac field. We can think of the particles 
associated with the Majorana field as massive neutrinos. 

We have 


Ci = \g<pW 

= ^'F T C’F , (49.1) 

where 4/ be a Majorana field (with mass m ) , ip is a real scalar field (with 
mass M ), and g is a coupling constant. In this section, we will be concerned 
with tree-level processes only, and so we omit renormalizing Z factors. 

From section 41, we have the LSZ rules appropriate for a Majorana 
field, 


b\( P)in - 

-> — i J d 4 x e +ipx v s (p){—i$ + rri)^ (x) 

(49.2) 


= +i J d 4 x 4/ T (x)C(+i<jD +m)u s ( p)e +ipx , 

(49.3) 

bs' (p )out — 

-> +i J d 4 x e~ ip ' x u s i(p')(—i^ + 7n)'F(x) , 

(49.4) 


= -ij d 4 x e~ ip ' x ^ T (x)C(+?:J + m)u s /(p / )e- ip ' a: . 

(49.5) 


Eq. (49.3) follows from eq. (49.2) by taking the transpose of the right-hand 
side, and using rv(p / ) T = —Cu s >(p') and (— + m) T = C(+iijD + m)C~ l \ 
similarly, eq. (49.5) follows from eq. (49.4). Which form we use depends on 
convenience, and is best chosen on a diagram-by-diagram basis, as we will 
see shortly. 

Eqs. (49.2-49.5) lead us to compute correlation functions containing ^’s, 
but not Ji’s. In position space, this leads to Feynman rules where the 
fermion propagator is 4 S(x — r/)C _1 , and the <y 2 'F'F vertex is igC\ the factor 
of 77 in C\ is canceled by a symmetry factor of 2! that arises from having 
two identical 4/ fields in C\. In a particular diagram, as we move along 
a fermion line, the C~ 1 in the propagator will cancel against the C in the 
vertex, leaving over a final C~ l at one end. This C -1 can be canceled by 
a C from eq. (49.3) (for an incoming particle) or eq. (49.5) (for an outgoing 
particle). On the other hand, for the other end of the same line, we should 
use either eq. (49.2) (for an incoming particle) or eq. (49.4) (for an outgoing 
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particle) to avoid introducing an extra C at that end. In this way, we can 
avoid ever having explicit factors of C in our Feynman rules. 1 

Using this approach, the Feynman rules for this theory are as follows. 

1. The total number of incoming and outgoing neutrinos is always even; 
call this number 2 n. Draw n solid lines. Connect them with internal 
dashed lines, using a vertex that joins one dashed and two solid lines. 
Also, attach an external dashed line for each incoming or outgoing 
scalar. In this way, draw all possible diagrams that are topologically 
inequivalent. 

2. Draw arrows on each segment of each solid line; keep the arrow di- 
rection continuous along each line. 

3. Label each external dashed line with the momentum of an incoming 
or outgoing scalar. If the particle is incoming, draw an arrow on the 
dashed line that points towards the vertex; If the particle is outgoing, 
draw an arrow on the dashed line that points away from the vertex. 

4. Label each external solid line with the momentum of an incoming or 
outgoing neutrino, but include a minus sign with the momentum if (a) 
the particle is incoming and the arrow points away from the vertex, or 
(b) the particle is outgoing and the arrow points towards the vertex. 
Do this labeling of external lines in all possible inequivalent ways. 
Two diagrams are considered equivalent if they can be transformed 
into each other by reversing all the arrows on one or more fermion 
lines, and correspondingly changing the signs of the external momenta 
on each arrow-reversed line. 

5. Assign each internal line its own four-momentum. Think of the four- 
momenta as flowing along the arrows, and conserve four-momentum 
at each vertex. For a tree diagram, this fixes the momenta on all the 
internal lines. 

6. The value of a diagram consists of the following factors: 
for each incoming or outgoing scalar, 1; 

for each incoming neutrino labeled with +pi, u Si (pj); 
for each incoming neutrino labeled with —pi, u Si (pj); 
for each outgoing neutrino labeled with +p', Tty (p)); 
for each outgoing neutrino labeled with — p[, vy (p)); 

x This is not always possible if the Majorana fields interact with Dirac fields, and we 
use the usual rules for the Dirac fields. 
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Figure 49.1: Two equivalent diagrams for ip — > vv. 
for each vertex, ig; 

for each internal scalar, — i/{k 2 + M 2 — ze); 

for each internal fermion, + m)/(p 2 + m 2 — ie). 

7. Spinor indices are contracted by starting at one end of a fermion 
line: specifically, the end that has the arrow pointing away from the 
vertex. The factor associated with the external line is either u or v. 
Go along the complete fermion line, following the arrows backwards, 
and writing down (in order from left to right) the factors associated 
with the vertices and propagators that you encounter. The last factor 
is either a u or v. Repeat this procedure for the other fermion lines, 
if any. 

8. The overall sign of a tree diagram is determined by drawing all con- 
tributing diagrams in a standard form: all fermion lines horizontal, 
with their arrows pointing from left to right, and with the left end- 
points labeled in the same fixed order (from top to bottom); if the 
ordering of the labels on the right endpoints of the fermion lines in a 
given diagram is an even (odd) permutation of an arbitrarily chosen 
fixed ordering, then the sign of that diagram is positive (negative). To 
compare two diagrams, it may be necessary to use the arrow-reversing 
equivalence relation of rule #4; there is then an extra minus sign for 
each arrow-reversed line. 

9. The value of iT is given by a sum over the values of all these diagrams. 

There are additional rules for counterterms and loops, but we will postpone 
those to section 51. 

Let’s look at the simplest process, — > vv. There are two possible 

diagrams for this, shown in fig. (49.1). However, according to rule #4, 
these two diagrams are equivalent, and we should keep only one of them. 
The first diagram yields iT\ = igv 2 u'i and the second zT 2 = igv[u 2 . Rule 
#8 then implies that we should have T\ = — 75. To check this, we note that 
(after dropping primes to simplify the notation) 


v\ u 2 = [vi u 2 ] t 
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Figure 49.2: Diagrams for — ► i/z/, corresponding to eq. (49.7). 


T — T 

= u 2 v 1 
= V 2 C~ l C~ X U\ 

= —V 2 U 1 , (49.6) 


as required. 

In general, for processes with a total of just two incoming and outgo- 
ing neutrinos, such as nip — ► nip or nn —> (pip, these rules give (up to an 
irrelevant overall sign) the same result for iT as we would get for the cor- 
responding process in the Dirac case, e~ip — > e~ip or e + e~ — > ipip. (Note, 
however, that in the Dirac case, we have C\ = , as compared with 

C\ = \gip x FH/ in the Majorana case.) 

The differences between Dirac and Majorana fermions become more 
pronounced for nn — > nn. Now there are three inequivalent contributing 
diagrams, shown in fig. (49.2). The corresponding amplitude can be written 
as 


iT 



( ujui){u2U 2 ) 

-t + M 2 


{u2Ui){u[u 2 ) (v2Ul){u[v2) ' 

-u + M 2 — s + M 2 J ’ 


(49.7) 


where s = —('P 1 +P 2 ) 2 , t = —{pi—p'i) 2 and u = — (pi— p' 2 ) 2 ■ After arbitrarily 
assigning the first diagram a plus sign, the minus sign of the second diagram 
follows from rule #8. To get the sign of the third diagram, we compare it 
with the first. To do so, we reverse the arrow direction on the lower line 
of the first diagram (which yields an extra minus sign), and then redraw 
it in standard form. Comparing this modified first diagram with the third 
diagram (and invoking rule #8) reveals a relative minus sign. Since the 
modified first diagram has a minus sign from the arrow reversal, we conclude 
that the third diagram has an overall plus sign. 

After taking the absolute square of eq. (49.7), we can use relations like 
eq. (49.6) on a term-by-term basis to put everything into a form that allows 
the spin sums to be performed in the standard way. In fact, we have already 
done all the necessary work in the Dirac case. The s-s, s-t, and t-t terms in 
(|Tj 2 ) for nn — > nn are the same as those for e + e“ — > e + e - , while the t-t, 
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t-u, and u-u terms are the same as those for the crossing-related process 
e~ e~ — > e _ e _ . Finally, the s-u terms can be obtained from the s-t terms 
via t u, or equivalently from the t-u terms via t s. Thus the result is 


(l^l 2 ) = 9 4 


+ 


+ 


(s — Am 2 ) 2 
(M 2 - s) 2 

(t — Am 2 ) 2 
l M 2 - t) 2 

(u — Am, 2 ) 2 
(M 2 - u ) 2 


+ 


+ 


+ 


st — Am 2 u 
(M 2 - s)(M 2 - t) 

tu — Am 2 s 
(M 2 - t)(M 2 - u) 

us — Am 2 t 
(M 2 - u) ( Af 2 - s) 


(49.8) 


which is neatly symmetric on permutations of s, t, and u. 


Problems 


49.1) Let f be a Dirac field (representing the electron and positron), X 
be a Majorana field (represeting the photino , the hypothetical super- 
symmetric partner of the photon, with mass rn- f ) , and E L and E R be 
two different complex scalar fields (representing the two selectrons, 
the hypothetical supersymmetric partners of the left-handed electron 
and the right-handed electron, with masses M L , and M R ; note that 
the subscripts L and R are just part of their names, and do not sig- 
nify anything about their Lorentz transformation properties). They 
interact via 


Cl = V2 eElXP^ + y/2eE&XP n 'f! + h.c. , (49.9) 

where a = e 2 /47r ~ 1/137 is the fine-structure constant, and P LiR = 
\^FTb)- 

a) Write down the hermitian conjugate term explicitly. 

b) Find the tree-level scattering amplitude for e + e~ — » 77. Hint: 
there are four contributing diagrams, two each in the t and u channels, 
with exchange of either E L or E R . 

c) Compute the spin-averaged differential cross section for this process 
in the case that m e (the electron mass) can be neglected, and |i|, |u| <C 
M l = M r . Express it as a function of s and the center-of-mass 
scattering angle 9. 
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50 Massless Particles and Spinor Helicity 

Prerequisite: 48 


Scattering amplitudes often simplify greatly if the particles are massless 
(or can be approximated as massless because the Mandelstam variables all 
have magnitudes much larger than the particle masses squared). In this 
section we will explore this phenomenon for spin-one-half (and spin-zero) 
particles. We will begin developing the technology of spinor helicity , which 
will prove to be of indispensible utility in Part III. 

Recall from section 38 that the u spinors for a massless spin-one-half 
particle obey 

u s {p)u s (p) = |(1 + s75)(-$0 , (50.1) 

where s = ± specifies the helicity, the component of the particle’s spin 
measured along the axis specified by its three-momentum; in this notation 
the helicity is |s. The v spinors obey a similar relation, 

v s (p)v s (p) = ±(1 - s 7 5 )(-$f) . (50.2) 


In fact, in the massless case, with the phase conventions of section 38, 
we have u s (p) = u_ s (p). Thus we can confine our discussion to it- type 
spinors only, since we need merely change the sign of s to accomodate 
u-type spinors. 

Consider a u spinor for a particle of negative helicity. We have 


«-(p)«-(p) = H 1 ~ 75)(-l0 • 


Let us define 
Then we also have 
Then, using 


Pad, — Pn a aa • 

e ac e™Pcc = P^ ha • 


7 m = 




W-ib) = 




1 O' 

0 0, 


in eq. (50.3), we find 


(50.3) 

(50.4) 

(50.5) 


(50.6) 


u-(p)u-{p) = 


' 0 -Pad ' 

0 0 


On the other hand, we know that the lower two components of 
vanish, and so we can write 


(50.7) 

M p) 


it_(p) = 



(50.8) 
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Here 0 a is a two-component numerical spinor; it is not an anticommuting 
object. Such a commuting spinor is sometimes called a twistor. An explicit 
numerical formula for it (verified in problem 50.2) is 


0a 


y/2ih 


-sin(i0)e-^\ 
+ cos(i0) ) 


(50.9) 


where 6 and 0 are the polar and azimuthal angles that specify the direction 
of the three-momentum p, and uj = |p|. Barring eq. (50.8) yields 


«- (P) = ( 0, 0d ) , 


(50.10) 


where 01 = (0 a )*. Now, combining eqs. (50.8) and (50.10), we get 


ii_(p)u_(p) 



Comparing with eq. (50.7), we see that 


(50.11) 


Pad — 0a0<j • 


(50.12) 


This expresses the four-momentum of the particle neatly in terms of the 
twistor that describes its spin state. The essence of the spinor helicity 
method is to treat 0 a as the fundamental object, and to express the parti- 
cle’s four-momentum in terms of it, via eq. (50.12). 

Given eq. (50.8), and the phase conventions of section 38, the positive- 
helicity spinor is 

U +(P)=( r a)’ ( 50 - 13 ) 

where 0* a = £ ac 0t. Barring eq. (50.13) yields 

u+(p) = (0°, 0). (50.14) 


Computation of n + (p)tt+ (p) via eqs. (50.13) and (50.14), followed by com- 
parison with eq. (50.1) with s = +, then reproduces eq. (50.12), but with 
the indices raised. 

In fact, the decomposition of p a a into the direct product of a twistor 
and its complex conjugate is unique (up to an overall phase for the twistor). 
To see this, use cr M = (/, a) to write 


/ — p° + p 3 
\ P 1 + ip 2 


1 • 2 \ 
P ~ ip \ 

—p° — p 3 J 


(50.15) 


The determinant of this matrix is — (p 0 ) 2 + p 2 , and this vanishes because 
the particle is (by assumption) massless. Thus p a d has a zero eigenvalue. 
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Therefore, it can be written as a projection onto the eigenvector correspond- 
ing to the nonzero eigenvalue. That is what eq. (50.12) represents, with the 
nonzero eigenvalue absorbed into the normalization of the eigenvector cj) a . 

Let us now introduce some useful notation. Let p and k be two four- 
momenta, and 4> a and n a the corresponding twistors. We define the twistor 
product 

{pk\ = (f) a Ka. (50.16) 

Because 4> a K a = s ac 4> c n a , and the twistors commute, we have 

[kp] = -\pk]. (50.17) 

From eqs. (50.8) and (50.14), we can see that 

u+(p)«_(k) = \pk\ . (50.18) 

Similarly, let us define 

(pfc) = . (50.19) 

Comparing with eq. (50.16) we see that 

{pk) = [kp}*, (50.20) 

which implies that this product is also antisymmetric, 

(kp) = -(pk). (50.21) 

Also, from eqs. (50.10) and (50.13), we have 

u_(p)«_l_(k) = (pk) . (50.22) 

Note that the other two possible spinor products vanish: 

u + (p)u + (k) = u_(p)u_(k) = 0 . (50.23) 

The twistor products {p k) and \p k] satisfy another important relation, 

(pk)[kp\ = {(plK* a )(K a (pa) 

= (, i>lct> a ){K a K * h ) 

= Paa^ 

= -2p^k^ , 

where the last line follows from u^ aa (jL = — 2g ,iu . 


(50.24) 
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Let us apply this notation to the tree-level scattering amplitude for 
e~p — > e~ p in Yukawa theory, which we first computed in Section 44, and 
which reads 


X'.s = 


= g 2 u s 


(P 


S(p+k) + S(p-k') 


u s { Pj 


(50.25) 


For a massless fermion, S(p) = — j/)/p 2 . If the scalar is also massless, then 
( p + k ) 2 = 2 p-k and ( p — k / ) 2 = — 2p • k! . Also, we can remove the fs in the 
propagator numerators in eq. (50.25), because rfu s ( p) = 0. Thus we have 


X's — 9" u s > (p ) 


2 p-k 


+ 


-r 

2p-k ' 


u s { P) • 


(50.26) 


Now consider the case s' = s = +. From eqs. (50.13), (50.14), and 


-{6=( ° K ° Kd ) , 

V K* a K a 0 J 

we get 

u+(p')(-ft)u+(p) = ^' a K a <</>* a 
= [p' k\ ( kp ) . 

Similarly, for s' = s = — , we find 

K* a K a (/) a 

= ( p'k ) [kp\ , 


while for s' X s, the amplitude vanishes: 


«-(p')(-^)«+(p) = u + (p')(-l/t)u-(p) = 0 . 


(50.27) 


(50.28) 


(50.29) 


(50.30) 


Then, using eq. (50.24) on the denominators in eq. (50.26), we find 


while 


T++ = -1 


71_ = 


\p'k] \p'k'}\ 

\pk ] \pk '])’ 


(p'k) ( p'k')\ 

{pk) { pk ') J ’ 

(50.31) 

T-+ = 0 . 

(50.32) 


Thus we have rather simple expressions for the fixed-helicity scattering 
amplitudes in terms of twistor products. 
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Reference Notes 
Spinor-helicity methods are discussed by Siegel. 

Problems 

50.1) Consider a bra-ket notation for twistors, 

I P] = u-(p) = v+(p) , 

I p) = U+(p) = u_(p) , 

\p\ = «+(p) = «-(p) , 

(p| = «-(p) = w+(p) . 

We then have 

{k\ | p) = {kp} , 

[k\ \p] = [kp], 

(k\ \p] = 0 , 

[k\ \p) = 0 . 

a) Show that 

~i>= \p)[p\ + \p\{p\ , 

where p is any massless four-momentum. 

b) Use this notation to rederive eqs. (50.28-50.30). 

50.2) a) Use eqs. (50.9) and (50.15) to verify eq. (50.12). 

b) Let the three-momentum p be in the +z direction. Use eq. (38.12) 
to compute tt±(p) explicitly in the massless limit (corresponding to 
the limit i j —> oo, where sinh?? = \p\/m). Verify that, when 9 = 0, 
your results agree with eqs. (50.8), (50.9), and (50.13). 

50.3) Prove the Schouten identity , 

(p q) (r s) + (p r) (s q) + (p s) (q r) = 0 . (50.36) 


(50.33) 

(50.34) 

(50.35) 


Hint: note that the left-hand side is completely antisymmetric in the 
three labels q, r, and s, and that each corresponding twistor has only 
two components. 
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50.4) Show that 

(pq) [qr] (r s) [sp] = Tr±(l- 7 5 )$^ , 

(50.37) 

and evaluate the right-hand side. 


50.5) a) Prove the useful identities 


(p\^\k] = [k\^\p) , 

(50.38) 

(p\Y l \k}* = {k\^\p\ , 

(50.39) 

(p\l^\p\ = 2 p^ , 

(50.40) 

<p|t m I k) = o , 

(50.41) 

\p\Y\k\ = 0 . 

(50.42) 

b) Extend the last two identies of part (a): show that the product 
of an odd number of gamma matrices sandwiched between either (p\ 
and |jfe) or [p| and |&] vanishes. Also show that the product of an even 
number of gamma matrices between either ( p\ and \k] or [p and \k) 
vanishes. 

c) Prove the Fierz identities, 


-\{p\iMy i = I?KpI + b) [? 1 > 

(50.43) 

-\\p\iM)Y l = \q)\p\ + \p](q\ ■ 

(50.44) 

Now take the matrix element of eq. (50.44) between (r 
another useful form of the Fierz identity, 

and \s] to get 

Ipl^lq) (r| 7 M |s] = 2 [ps\ (qr) . 

(50.45) 
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51 Loop Corrections in Yukawa Theory 

Prerequisite: 19, 40, 48 

In this section we will compute the one-loop corrections in Yukawa theory 
with a Dirac field. The basic concepts are all the same as for a scalar field, 
and so we will mainly be concerned with the extra technicalities arising 
from spin indices and anticommutation. 

First let us note that the general discussion of sections 18 and 29 leads 
us to expect that we will need to add to the lagrangian all possible terms 
whose coefficients have positive or zero mass dimension, and that respect 
the symmetries of the original lagrangian. These include Lorentz symmetry, 
the U(l) phase symmetry of the Dirac field, and the discrete symmetries 
of parity, time reversal, and charge conjugation. 

The mass dimensions of the fields (in four spacetime dimensions) are 
[p\ = 1 and [T] = |. Thus any power of p up to p 4 is allowed. But there 
are no additional required terms involving \k: the only candidates contain 
either 75 (e.g., z 'T 7/5 VP ) and are forbidden by parity, or C (e.g, T T C'F) and 
are forbidden by the U(l) symmetry. 

Nevertheless, having to deal with the addition of three new terms ( p , 
p 3 , p 4 ) is annoying enough to prompt us to look for a simpler example. 


Consider, then, a modified form of the Yukawa interaction, 

-Cyuk = w'h 75 T . (51.1) 

This interaction will conserve parity if and only if <p is a pseudoscalar: 

P~ 1 p(x, t)P = — p(— x, t) . (51-2) 

Then, p and p 3 are odd under parity, and so we will not need to add them 
to C. The one term we will need to add is p 4 . 

Therefore, the theory we will consider is 

C = C 0 + C 1 , (51.3) 

C 0 = - mW - l 2 d IJ pd lt p - \M 2 p 2 , (51.4) 

C\ = iZggp'Lys'L - ^ Z\\p 4 + C c t , (51.5) 

C ct = i(Zy- l)f^f - (Z m -l)mM 

- \(Z v -l)d^pdgp - \{Z m -1)M 2 p 2 (51.6) 


where A is a new coupling constant. We will use an on-shell renormaliza- 
tion scheme. The lagrangian parameter m is then the actual mass of the 
electron. We will define the couplings g and A as the values of appropri- 
ate vertex functions when the external four-momenta vanish. Finally, the 
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fields are normalized according to the requirements of the LSZ formula. In 
practice, this means that the scalar and fermion propagators must have 
appropriate poles with unit residue. 

We will assume that M < 2 m, so that the scalar is stable against decay 
into an electron-positron pair. The exact scalar propagator (in momentum 
space) can be then written in Lehmann-Kallen form as 


A (k 2 ) 


1 

k 2 + M 2 - ie 



P(s) 

k 2 + s — ie 


(51.7) 


where the spectral density p(s) is real and nonnegative. The threshold mass 
M t h is either 2m (corresponding to the contribution of an electron-positron 
pair) or 3 M (corresponding to the contribution of three scalars; by parity, 
there is no contribution from two scalars), whichever is less. 

We can also write 


A ( fc 2 )” 1 = k 2 + M 2 - ie - U{k 2 ) , (51.8) 


where ill(k 2 ) is given by the sum of one-particle irreducible (1PI for short; 
see section 14) diagrams with two external scalar lines, and the external 
propagators removed. The fact that A (k 2 ) has a pole at k 2 = —M 2 with 
residue one implies that II (— M 2 ) = 0 and lT(— M 2 ) = 0; this fixes the 
coefficients and Zm ■ 

All of this is mimicked for the Dirac field. When parity is conserved, the 
exact propagator (in momentum space) can be written in Lehmann-Kallen 
form as 


sm = + r ds =irM±ZIMii , 

rr + m z — ie J m 2 p z + s — ie 

x th 


(51.9) 


where the spectral densities pi(s) and P 2 ( s ) are both real, and p\(s) is non- 
negative and greater than p 2 (s). The threshold mass m t h is m + M (corre- 
sponding to the contribution of a fermion and a scalar), which, by assump- 
tion, is less than 3 m (corresponding to the contribution of three fermions; 
by Lorentz invariance, there is no contribution from two fermions). 

Since p 2 = we can rewrite eq. (51.9) as 


SM = 


+ m — ie 


+ 


ds 


1 2 

\h 


-rfpijs) + y/ip2(g) 

{-i + \fs ~ + yfs - ie) 


(51.10) 


with the understanding that l/(. . .) refers to the matrix inverse. However, 
since ]J is the only matrix involved, we can think of S(^) as an analytic 
function of the single variable With this idea in mind, we see that S(^) 
has an isolated pole at = —m with residue one. This residue corresponds 
to the field normalization that is needed for the validity of the LSZ formula. 
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Figure 51.1: The one-loop and counterterm corrections to the scalar prop- 
agator in Yukawa theory. 


We can also write the exact fermion propagator in the form 

SfiCr 1 = + m - ie - , (51.11) 


where i£(^) is given by the sum of 1PI diagrams with two external fermion 
lines, and the external propagators removed. The fact that S(^) has a pole 
at ]j> = —m with residue one implies that £(— m) = 0 and = 0; this 

fixes the coefficients Zq, and Z m . 

We proceed to the diagrams. The Yukawa vertex carries a factor of 
i(iZ g g ) 75 = —Z g g'j 5 . Since Z g = 1 + 0(g 2 ), we can set Z g = 1 in the 
one-loop diagrams. 

Consider first n(fc 2 ), which receives the one-loop (and counterterm) 
corrections shown in fig. (51.1). The first diagram has a closed fermion loop. 
As we will see in problem 51.1 (and section 53), anticommutation of the 
fermion fields results in an extra factor of minus one for each closed fermion 
loop. The spin indices on the propagators and vertices are contracted in 
the usual way, following the arrows backwards. Since the loop closes on 
itself, we end up with a trace over the spin indices. Thus we have 


*n^i oop (A: 2 ) = (— 1)(— g) 2 (i) 2 


dH 


Tr 


S(^)75%)75 


(51.12) 


where 

—rf + m 
p 2 + m 2 — ie 

is the free fermion propagator in momentum space. 
We now proceed to evaluate eq. (51.12). We have 




(51.13) 


Tr[(— / - # + m)7 5 (-/ + 771)75] = Tr[(-/ - # + m)(+f + m)] 

= 4[(£ + k)£ + m 2 ] 


= AN . 


(51.14) 
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The first equality follows from = 1 and 75/75 = — /. 

Next we combine the denominators with Feynman’s formula. Suppress- 
ing the ie’s, we have 


i l = f 1 dx — - - 

( l+k ) 2 + m 2 t 2 + m 2 7o (c/ 2 + -D) 2 ’ 


(51.15) 


where q = £ + xk and D = x(l—x)k 2 + m 2 . 

We then change the integration variable in eq. (51.12) from t to q\ the 
result is 

9 f 1 f d% N 

ihlmoop{k 2 ) = 4 g 2 ^ dx J {q2 + D)2 , (51-16) 

where now N = (q + (l—x)k)(q — xk) + m 2 . The integral diverges, and so we 
analytically continue it to d = 4—e spacetime dimensions. (Here we ignore a 
subtlety with the definition of 75 in d dimensions, and assume that 75 = 1 
and 75/75 = — / continue to hold.) We also make the replacement g —> 
<7/i e / 2 , where g has dimensions of mass, so that g remains dimensionless. 
Expanding out the numerator, we have 


N = q 2 — x(l— x)k 2 + m 2 + (1—2 x)kq . 


(51.17) 


The term linear in q integrates to zero. For the rest, we use the general 
result of section 14 to get 


{2it) d (q 2 + D ) 2 167T 2 |_ e 


-HD/p 2 ) 


(51.18) 


+ 4 — In {D/p 2 ) (-2 D) , (51.19) 


' J (2n) d ( q 2 + D ) 2 ~ 16tt 2 U 2 v 'J v v ' 

where g 2 = 47re _7 /7 2 , and we have dropped terms of order e. Plugging 
eqs. (51.18) and (51.19) into eq. (51.16) yields 

n^ioo P (4’ 2 ) = -^3 7^ 2 + 2m2 ) + \k 2 + m 2 


J dx (^3x(l—x)k 2 + m 2 ^jln(D/g 2 ) . (51.20) 


We see that the divergent term has (as expected) a form that permits 
cancellation by the counterterms. 

We evaluated the second diagram of fig. (51.1) in section 31, with the 
result 

n^ioop (k 2 ) = 7 + 5-5 HM 2 /g 2 ) M 2 . (51.21) 
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>-■ > 

p p+i p p p 


Figure 51.2: The one- loop and counterterm corrections to the fermion prop- 
agator in Yukawa theory. 


The third diagram gives the contribution of the counterterms, 

n ct (/c 2 ) = -{Z^-l)k 2 - {Z m ~1)M 2 . (51.22) 

Adding up eqs. (51.20-51.22), we see that finiteness of n(/c 2 ) requires 


^ = ’ 


Zm = 1 + 


A 

167T 2 27T 2 




(51.23) 

(51.24) 


plus higher-order (in g and/or A) corrections. Note that, although there is 
an 0( A) correction to Zm, there is not an 0( A) correction to Z ip . 

We can impose II(— M 2 ) = 0 by writing 


n ' fc2 > - h 


IJo 


dx (3x(l—x)k 2 + m 2 )ln(D/D 0 ) + K^k 2 + M 2 


(51.25) 


where Dq = —x{l—x)M 2 + m 2 , and is a constant to be determined. We 
fix by imposing IT ( — M 2 ) = 0, which yields 


= / dx x{l— x)[3x(l— x)M 2 — m 2 ]/Do 


(51.26) 


Note that, in this on-shell renormalization scheme, there is no 0(A) correc- 
tion to U(k 2 ). 

Next we turn to the propagator, which receives the one-loop (and 
counterterm) corrections shown in fig. (51.2). The spin indices are con- 
tracted in the usual way, following the arrows backwards. We have 


*Siioop(iO = (-5) 2 (i) 


dH 

( 2' 7r ) 4 


75^(2/ + /)75 A(£ 2 ), 


(51.27) 


where S(j/>) is given by eq. (51.13), and 


A(£ 2 ) 


1 

t 2 + Af 2 - ie 


(51.28) 
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is the free scalar propagator in momentum space. 

We evaluate eq. (51.27) with the usual bag of tricks. The result is 


iZ’Uoopitf) = -g J ^ dx J ( 27 t )4 (g 2 + jpy ! (51.29) 

where q = t + xp and 

N = j + (1— x)q) + m , (51.30) 

D = x{l—x)p 2 + xm 2 + (1— x)M 2 . (51.31) 

The integral diverges, and so we analytically continue it to d = 4 — e 


spacetime dimensions, make the replacement g — » gp £ ^ 2 : and take the limit 
as £ -> 0. The term linear in q in eq. (51.30) integrates to zero. Using 
eq. (51.18), we get 

SiioopW = -^3 7 (^+ 2 \m)~ J dx ((l-x’)i/ + ?n)ln(T>/// 2 ) . (51.32) 

We see that the divergent term has (as expected) a form that permits 
cancellation by the counterterms, which give 

s ct(V) = -{Zy- \)i~ (Z m -l)m . (51.33) 

Adding up eqs. (51.32) and (51.33), we see that finiteness of £(^) requires 

= 1 - Q + finite ) > ( 5L34 ) 

Z m = 1 - Q + j (51.35) 

plus higher-order corrections. 

We can impose £(— m) = 0 by writing 

£($ = ^1^2 J q dx((l-x)tf + myn(D/D 0 ) + Ky(rf + m) , (51.36) 

where D 0 is D evaluated at p 2 = —m 2 , and Kqt is a constant to be deter- 
mined. We fix Kq> by imposing £'(— m) = 0. In differentiating with respect 
to we take the p 2 in D, eq. (51.31), to be —]j 2 \ we find 

Kq, = —2 [ dx x 2 (l— x)m 2 /Do . 

Jo 


(51.37) 
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Figure 51.3: The one-loop correction to the scalar-fermion-fermion vertex 
in Yukawa theory. 

Next we turn to the correction to the Yukawa vertex. We define the 
vertex function i V y (p ' , p) as the sum of one-particle irreducible diagrams 
with one incoming fermion with momentum p , one outgoing fermion with 
momentum p ' , and one incoming scalar with momentum k = p' — p. The 
original vertex — Z g gy§ is the first term in this sum, and the diagram of 
fig. (51.3) is the second. Thus we have 

iVy{p',p) = —ZgQTb + *Vy lloop {p',p) + 0(g 5 ) , (51.38) 

where 

/ \3 C d^P v ~ - i _ 

iV Y ,noo P (p,p) = (-s0 3 (j) J [ 75 *§(/+/) 75 -§(^+/) 75 ]A(^ 2 ) . 

(51.39) 

The numerator can be written as 

N = (/ + / + ra)(-^ - f- + m)y 5 , (51.40) 

and the denominators combined in the usual way. We then get 

/ P di^ 

dF 3 j (g2 + D y > ( 5L41 ) 

where the integral over Feynman parameters was defined in section 16, and 
now 

q = £ + x\p + xyp , (51.42) 

N = [j - Xirf + (1-X2 )/ + m\ [~i~ (l-xi)j^ + x 2 / + m]y 5 , (51.43) 

D = xi(l— xi)p 2 + x 2 (l— x 2 )p /2 — 2xi xyp-p 

+ (xi+x 2 )?n 2 + x z M 2 . (51.44) 

Using jj = —q 2 , we can write N as 

N = q 2 75 + N + (linear in q) , 


(51.45) 
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Figure 51.4: One of six diagrams with a closed fermion loop and four 
external scalar lines; the other five are obtained by permuting the external 
momenta in all possible inequivalent ways. 

where 

N = [—x\$ + (l-x 2 )/ + m] [- + x 2 / + m] 75 . (51.46) 

The terms linear in q in eq. (51.45) integrate to zero, and only the first term 
is divergent. Performing the usual manipulations, we find 

iV Y ,iioop{p' ,P) = ^2 \ ~ \ J dF ?> ln(D//i 2 ))75 + 1/^3 ^ . 

(51.47) 

From eq. (51.38), we see that finiteness of V y (jJ , p) requires 

Z g = 1 + ^-2 Q + finite^ , (51.48) 

plus higher-order corrections. 

To fix the finite part of Z g , we need a condition to impose on \y{p' ,p)- 
We will mimic what we did in ip 3 theory in section 16, and require Vy (0, 0) 
to have the tree-level value *< 775 . We leave the details to problem 51.2. 

Next we turn to the corrections to the ip 4 vertex iV^ki, £ 7 , £’4); the 
tree-level contribution is —iZ\ A. There are diagrams with a closed fermion 
loop, as shown in fig. (51.4), plus one-loop diagrams with p particles only 
that we evaluated in section 31. We have 

*V 4 ,*ioop = (-1 )H?) 4 (i) 4 / 

X -S'(/+^ 2 +^ 3 )75*5(/+^ 2 )75 
+ 5 permutations of (£ 2 , £3, £4) . (51.49) 


Again we can employ the standard methods; there are no unfamiliar as- 
pects. This being the case, let us concentrate on obtaining the divergent 
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part; this will give us enough information to calculate the one- loop contri- 
butions to the beta functions for g and A. 

To obtain the divergent part of eq. (51.49), it is sufficient to set ki = 
0. The term in the numerator that contributes to the divergent part is 
Tr (/7 5 ) 4 = A(£ 2 ) 2 , and the denominator is (l 2 + m 2 ) 4 . Then we find, after 
including identical contributions from the other five permutations of the 
external momenta, 


v 4) * loop = -~^2 (- + finite) . 


7 r" \ £ 


From section 31, we have 


V. 


3A n 


4 ^l° op - 167r2 l 


— b finite . 


Then, using 


(51.50) 


(51.51) 


V 4 — —Z\X + V 4) vfl loop + V 4iV ioop + • • • 
we see that finiteness of V 4 requires 

' 3A 3 5 4 \/l 


Z\ = 1 + 


I67 r 2 7t 2 A \£ 


+ finite 


(51.52) 


(51.53) 


plus higher-order corrections. 


Reference Notes 


A detailed derivation of the Lehmann-Kallen form of the fermion propaga- 
tor can be found in Itzykson & Zuber. 

Problems 

51.1) Derive the fermion- loop correction to the scalar propagator by work- 
ing through eq. (45.2), and show that it has an extra minus sign rel- 
ative to the case of a scalar loop. 

51.2) Finish the computation of Yy{p' ,p), imposing the condition 

Vy(0, 0) = i<775 . (51.54) 

51.3) Consider making ip a scalar rather than a pseudoscalar, so that the 
Yukawa interaction is £yuk = gp'L'L ■ In this case, renormalizability 
requires us to add a term 3 = ^Z K mp 3 , as well as term linear in p to 
cancel tadpoles. Find the one-loop contributions to the renormalizing 
Z factors for this theory in the MS scheme. 
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52 Beta Functions in Yukawa Theory 

Prerequisite: 28, 51 

In this section we will compute the beta functions for the Yukawa coupling 
g and the (f 4 coupling A in Yukawa theory, using the methods of section 28. 
The relations between the bare and renormalized couplings are 


g 0 = Z- l / 2 Z^ l Z g ~g £ / 2 g , 

(52.1) 

Ao = Z v 2 Z x ~g £ X . 

(52.2) 

Let us define 


= £ ALA , 

n= 1 

(52.3) 

Kv^) = t - 

71=1 

(52.4) 

From our results in section 51, we have 


5l7 2 

Gli9 ' X) ~ 16vr 2+ -- - ’ 

(52.5) 

T / w 3A i g 2 3g 4 , 

il(9 ' A) - 16ir 2 2ir 2 + •• • > 

(52.6) 


where the ellipses stand for higher-order (in g 2 and/or A) corrections. 

Taking the logarithm of eqs. (52.1) and (52.2), and using eqs. (52.3) and 
(52.4), we get 


In go = Y T ln S' + 2 ell W , 

n= l e 

In Ao = ^ ^ — - + In A + e In g . 

n=l 6 


(52.7) 

(52.8) 


We now use the fact that go and Ao must be independent of g. We differen- 
tiate eqs. (52.7) and (52.8) with respect to In//; the left-hand sides vanish, 
and we multiply the right-hand sides by g and A, respectively. The result 
is 

f 9G n dg d G n rfA \ 1 dg , 

0 = \ 9 + + ’ ( 52 ’ 9 ) 


dg ding d\ ding 

0 = f (a/e * + a/A JY )l + JY +eX . (52.10) 

_ V dg d In g dX d In g J e n d In g 
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In a renormalizable theory, dg/dlnp and dX/dlnp must be finite in the 
e — > 0 limit. Thus we can write 

= -\eg + p g (g, A) , (52.11) 

-^- = -eX + (3 x {g,X). (52.12) 

Substituting these into eqs. (52.9) and (52.10), and matching powers of e, 
we find 

Pg(9, A) = g(\g-r^ + A^ , (52.13) 

&(S,A) = A (^ + A^T! . (52.14) 

The coefficients of all higher powers of 1/e must also vanish, but this gives 
us no more information about the beta functions. 

Using eqs. (52.5) and (52.6) in eqs. (52.13) and (52.14), we get 

5r; 3 

A g{d, A) = + • • • > (52.15) 

P\(g, A) = ^(3A 2 + 8A g 2 - 48 g 4 ) + . . . . (52.16) 

The higher-order corrections have extra factors of g 2 and/or A. 

Problems 

52.1) Compute the one-loop contributions to the anomalous dimensions of 
m, M, \H, and ip. 

52.2) Consider the theory of problem 51.3. Compute the one-loop contri- 
butions to the beta functions for g , A, and k, and to the anomalous 
dimensions of m, M, T, and ip. 

52.3) Consider the beta functions of eqs. (52.15) and (52.16). 

a) Let p = A /g 2 , and compute dp/ din p. Express your answer in 
terms of g and p. Explain why it is better to work with g and p 
rather than g and A. Hint: the answer is mathematical, not physical. 

b) Show that there are two fixed points , p* j_ and p*_, where dp/ din p = 
0, and find their values. 

c) Suppose that, for some particular value of the renormalization scale 
p, we have p = 0 and g 1. What happens to p at much higher 
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values of p (but still low enough to keep j < 1)? At much lower 
values of //? 

d) Same question, but with an initial value of p = 5. 

e) Same question, but with an initial value of p = —5. 

f) Find the trajectory in the (p, g) plane that is followed for each 
of the three starting points as p is varied up and down. Hint: you 
should find that the trajectories take the form 


9 = 9 o 


P~ P*+ 
P~ pr- 


ior some particular exponent v. Put arrows on the trajectories that 
point in the direction of increasing p. 

g) Explain why p*_ is called an ultraviolet stable fixed point, and why 
p* + is called an infrared stable fixed point. 
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53 Functional Determinants 

Prerequisite: 44, 45 

In the section we will explore the meaning of the functional determinants 
that arise when doing gaussian path integrals, either bosonic or fermionic. 
We will be interested in situations where the path integral over one partic- 
ular field is gaussian, but generates a functional determinant that depends 
on some other field. We will see how to relate this functional determinant 
to a certain infinite set of Feynman diagrams. We will need the technology 
we develop here to compute the path integral for nonabelian gauge theory 
in section 70. 

We begin by considering a theory of a complex scalar field \ with 

C = -d^x^dpx ~ + 9FX ] X , (53.1) 

where ip is a real scalar background field. That is, (p(x) is treated as a fixed 
function of spacetime. Next we define the path integral 

%) = |p X t P X e 1 / A£ , (53.2) 

where we use the e trick of section 6 to impose vacuum boundary conditions, 
and the normalization Z( 0) = 1 is fixed by hand. 

Recall from section 44 that if we have n complex variables Zi, then we 
can evaluate gaussian integrals by the general formula 

J d n zd n z exp (—iziMijZj) oc (detAf) -1 . (53.3) 

In the case of the functional integral in eq. (53.2), the index i on the inte- 
gration variable is replaced by the continuous spacetime label x. and the 
“matrix” M becomes 

M(x,y ) = [—d 2 + m 2 — gip(x)]S 4 (x — y ) . (53.4) 

In order to apply eq. (53.3), we have to understand what it means to com- 
pute the determinant of this expression. 

To this end, let us first note that we can write M = MqM, which is 
shorthand for 

M(x,z ) = j d 4 yM 0 (x,y)M(y,z ) , (53.5) 

where 

M 0 (x, y) = {-dl + m 2 )S 4 (x - y) , (53.6) 

M(y, z) = 5 4 (y - z) - gA(y - z)cp(z) . (53.7) 
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Here A (y — z) is the Feynman propagator, which obeys 

(~dy +m 2 )A(y - z) = 5 4 {y - z) . (53.8) 

After various integrations by parts, it is easy to see that eqs. (53.5-53.7) 
reproduce eq. (53.4). 

Now we can use the general matrix relation det AB = det A det B to 
conclude that 

detM = det Mo detM . (53.9) 

The advantage of this decomposition is that M$ is independent of the 
background field ip, and so the resulting factor of (det Mo) -1 in Z(ip) can 
simply be absorbed into the overall normalization. Furthermore, we have 
M = I — G, where 

I{x,y) = S 4 (x - y) (53.10) 

is the identity matrix, and 

G(x, y ) = gA(x - y)<p(y) . (53.11) 

Thus, for ip{x ) = 0, we have M = I and so det M = 1. Then, using 
eq. (53.3) and the normalization condition Z( 0) = 1, we see that for nonzero 
ip(x) we must have simply 

Z(<p) = (detM)- 1 . (53.12) 


Next, we need the general matrix relation det A = exp Tr In A. which is 
most easily proved by working in a basis where A is in Jordan form (that 
is, all entries below the main diagonal are zero). Thus we can write 


detM = expTHnM 


= exp Tr ln(/ — G) 



Combining eqs. (53.12) and (53.13) we get 


OO 

Z(<P) = exp^ — Tr G n , 

n= 1 


(53.13) 


(53.14) 


where 

Tr G n = g n J d 4 xi . . . d 4 x n A(xi— a^V^) • • • A(x n — x\)ip(xi) . (53.15) 



53: Functional Determinants 


328 



Figure 53.1: All connected diagrams with <p(x) treated as an external field. 
Each of the n dots represents a factor of igtp(x), and each solid line is a x 
or T propagator. 

This is our final result for Z(cp). 

To better understand what it means, we will rederive it in a different 
way. Consider treating the g<px'X term in C as an interaction. This leads 
to a vertex that connects two x propagators; the associated vertex factor 
is ig(p(x). According to the general analysis of section 9, we have Z{ip) = 
exp ?T (ip), where ?T(<^) is given by a sum of connected diagrams. (We have 
called the exponent T rather than W because it is naturally interpreted as a 
quantum action for cp after x has been integrated out.) The only connected 
diagrams we can draw with these Feynman rules are those of fig. (53.1), 
with n insertions of the vertex, where n > 1. The diagram with n vertices 
has an n-fold cyclic symmetry, leading to a symmetry factor of S = n. 
The factor of i associated with each vertex is canceled by the factor of 1 /i 
associated with each propagator. Thus the value of the n- vertex diagram 
is 

— g n [ d 4 xi . . . d A x n A(xi-x 2 )^(x 2 ) . . . A(x n —xi)p>(xi) . (53.16) 

n J 

Summing up these diagrams, and using eq. (53.15), we find 

OO 

iT(<p) = ^ -TrG n . (53.17) 

. Tl 
n= 1 

This neatly reproduces eq. (53.14). Thus we see that a functional determi- 
nant can be represented as an infinite sum of Feynman diagrams. 

Next we consider a theory of a Dirac fermion 4/ with 

C = iF(j)F - rnTT + gtpFF , (53.18) 

where ip is again a real scalar background field. We define the path integral 

Z(tp) = J VFVF e i f d4xC , 


(53.19) 
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where we again use the e trick to impose vacuum boundary conditions, and 
the normalization Z( 0) = 1 is fixed by hand. 

Recall from section 44 that if we have n complex Grassmann variables 
'ipi, then we can evaluate gaussian integrals by the general formula 

J d^dJhp exp (—i'tpiMij'i/jj) oc det M . (53.20) 

In the case of the functional integral in eq. (53.19), the index i on the 
integration variable is replaced by the continuous spacetime label x plus 
the spinor index a, and the “matrix” M becomes 

M a p{x, y) = \-i<j) x + m- gip(x)} af3 5 A (x - y ) . (53.21) 

In order to apply eq. (53.20), we have to understand what it means to 
compute the determinant of this expression. 

To this end, let us first note that we can write M = MqM, which is 
shorthand for 

M ai {x,z ) = J d 4 y M 0al3 (x,y)M l3l (y, z) , (53.22) 

where 

M 0a p(x, y) = + m) aj3 5 A {x - y) , (53.23) 

Mfa (y, z) = 5f3 1 5 A {y - z) - gS^{y - z)^{z) . (53.24) 

Here Sg^(y — z) is the Feynman propagator, which obeys 

{~i$ y + ™) a pSfa(y - z) = 5 ai 5 A (y - z) . (53.25) 

After various integrations by parts, it is easy to see that eqs. (53.22-53.24) 
reproduce eq. (53.21). 

Now we can use eq. (53.9). The advantage of this decomposition is that 
Mo is independent of the background held ip, and so the resulting factor 
of det Mo in Z(cp) can simply be absorbed into the overall normalization. 
Furthermore, we have M = I — G, where 

I a p(x, y) = d a g5 A (x - y) (53.26) 

is the identity matrix, and 

G a g(x, y) = gS a p(x - y)ip(y) . (53.27) 

Thus, for ip{x) = 0, we have M = I and so det M = 1. Then, using 
eq. (53.20) and the normalization condition Z( 0) = 1, we see that for 
nonzero ip{x) we must have simply 

Z{tp) = det M . 


(53.28) 
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Next, we use eqs. (53.13) and (53.28) to get 


OO ^ 

Z(<p) = exp — — Tr G n , (53.29) 

“ n 

n — 1 


where now 

Tr G n = g n J d 4 x\ . . . d A x n tr S(x\ — X 2 )<f>(x 2 ) ■ ■ ■ S(x n — xi)(p(x\) , (53.30) 


and “tr” denotes a trace over spinor indices. This is our final result for 
Z{<P)- 

To better understand what it means, we will rederive it in a different 
way. Consider treating the y.p'FF term in C as an interaction. This leads 
to a vertex that connects two T propagators; the associated vertex factor 
is igtp(x). According to the general analysis of section 9, we have Z(ip) = 
exp *r (y>), where *T(<^) is given by a sum of connected diagrams. (We have 
called the exponent T rather than W because it is naturally interpreted 
as a quantum action for <p after has been integrated out.) The only 
connected diagrams we can draw with these Feynman rules are those of 
fig. (53.1), with n insertions of the vertex, where n > 1. The diagram with 
n vertices has an n-fold cyclic symmetry, leading to a symmetry factor of 
S = n. The factor of i associated with each vertex is canceled by the factor 
of 1 /i associated with each propagator. The closed fermion loop implies a 
trace over the spinor indices. Thus the value of the n - vertex diagram is 



n 


d A x i . . . d A x n tr S(x\— 2:2 V(® 2 ) • • • S(x n —xi)ip(xi) . 


(53.31) 


Summing up these diagrams, we find that we are missing the overall minus 
sign in eq. (53.29). The appropriate conclusion is that we must associate 
an extra minus sign with each closed fermion loop. 



Part III 

Spin One 
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54 Maxwell’s Equations 

Prerequisite: 3 

The most common (and important) spin-one particle is the photon. Emis- 
sion and absorption of photons by matter is an important phenomenon in 
many areas of physics, and so that is the context in which most physicists 
first encounter a serious treatment of photons. We will use a brief review 
of this subject (in this section and the next) as our entry point into the 
theory of quantum electrodynamics. 

Let us begin with classical electrodynamics. Maxwell’s equations are 


V-E = p , (54.1) 

VxB-E = J, (54.2) 

V x E + B = 0 , (54.3) 

V-B = 0 , (54.4) 


where E is the electric field, B is the magnetic field, p is the charge density, 
and J is the current density. We have written Maxwell’s equations in 
Heaviside- Lorentz units , and also set c = 1. In these units, the magnitude 
of the force between two charges of magnitude Q is Q 2 /4nr 2 . 

Maxwell’s equations must be supplemented by formulae that give us 
the dynamics of the charges and currents (such as the Lorentz force law for 
point particles). For now, however, we will treat the charges and currents as 
specified sources, and focus on the dynamics of the electromagnetic fields. 

The last two of Maxwell’s equations, the ones with no sources on the 
right-hand side, can be solved by writing the E and B fields in terms of a 
scalar potential <p and a vector potential A, 

E = -Vip - A , (54.5) 

B = V x A . (54.6) 

The potentials uniquely determine the fields, but the fields do not uniquely 
determine the potentials. Given a particular <p and A that result in a 
particular E and B, we will get the same E and B from any other potentials 
ip' and A' that are related by 

ip' = p + T , (54.7) 

A 7 = A - vr , (54.8) 

where T is an arbitrary function of spacetime. A change of potentials that 
does not change the fields is called a gauge transformation. The E and B 
fields are gauge invariant. 
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All this becomes more compact and elegant in a relativistic notation. 
We define the four-vector potential or gauge field 


A» = {<p, A). 

(54.9) 

We also define the field strength 


F pv = d p A u - d v A p . 

(54.10) 

Obviously, F^ w is antisymmetric: F ^ = —F Ufl . 
(54.6) with eqs. (54.9) and (54.10), we see that 

Comparing eqs. (54.5) and 

'Eq 

II 

O 

(54.11) 

F ij = e ijk B k . 

(54.12) 

The first two of Maxwell’s equations can now be written as 

d v F pv = J p , 

(54.13) 

where 

J^ = (P, J) 

(54.14) 

is the charge-current density four-vector. 

If we take the four-divergence of eq. (54.13), 

, we get d p d, / F^ W = d p J^ 1 . 


The left-hand side of this equation vanishes, because d p d v is symmetric on 
exchange of p and v, while F 1 ' 1 ' is antisymmetric. We conclude that we 
must have 

d,,,!" = 0 , (54.15) 

or equivalently 

p + V- J = 0 ; (54.16) 

that is, the electromagnetic current must be conserved. 

The last two of Maxwell’s equations can be written as 

e^ P ad p F^ = 0 , (54.17) 

where £ pupa is the completely antisymmetric Levi-Civita tensor; see section 
34. Plugging in eq. (54.10), we see that eq. (54.17) is automatically satisfied, 
since the antisymmetric combination of two derivatives vanishes. 

Eqs. (54.7) and (54.8) can be combined into 

A ,p = A 11 - d p T . (54.18) 

Setting F'l w = d^A ,u — d'A' 11 and using eq. (54.18), we get 


p gw = ppv _ _ qvq^y . 


(54.19) 
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The last term vanishes because derivatives commute; thus the field strength 
is gauge invariant, 

F' IW = F' JV . (54.20) 

Next we will find an action that results in Maxwell’s equations as the 
equations of motion. We will treat the current as an external source. The 
action we seek should be Lorentz invariant, gauge invariant, parity and 
time-reversal invariant, and no more than second order in derivatives. The 
only candidate is S = f d 4 x£, where 

C = -\F» V F„ V + .FA,, . (54.21) 

The first term is obviously gauge invariant, because F^ u is. After a gauge 
transformation, eq. (54.18), the second term becomes <FA'^, and the differ- 
ence is 


j^-aj = - J^r 

= -(c^)r-cVGTT) . (54.22) 

The first term in eq. (54.22) vanishes because the current is conserved. The 
second term is a total divergence, and its integral over d A x vanishes (assum- 
ing suitable boundary conditions at infinity). Thus the action specified by 
eq. (54.21) is gauge invariant. 

Setting F^ u = d^A u — d'A IJ/ and multiplying out the terms, eq. (54.21) 
becomes 

C = + \dFA v d v A Vi + .FA,, (54.23) 

= +\A^d 2 - d ll d v )A v + .FA,, - &‘K„ , (54.24) 

where K, t = ^A v {jd^A v —d v A^). The last term is a total divergence, and can 
be dropped. From eq. (54.24), we can see that varying A ^ while requiring 
S to be unchanged yields the equation of motion 

{g^d 2 - d l ‘d v )A l , + J» = 0 . (54.25) 

Noting that dyF^ = d I ,(d l A 1 ' — d v A 11 ) = (d ,Ji d v — g flu d 2 )A u , we see that 
eq. (54.25) is equivalent to eq. (54.13), and hence to Maxwell’s equations. 
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55 Electrodynamics in Coulomb Gauge 

Prerequisite: 54 

Next we would like to construct the hamiltonian, and quantize the electro- 
magnetic held. 

There is an immediate difficulty, caused by the gauge invariance: we 
have too many degrees of freedom. This problem manifests itself in several 
ways. For example, the lagrangian 

£ = —\F tJ,v F t]V + J^A/j (55.1) 

= + \& l A v d v A ll + (55.2) 

does not contain the time derivative of A 0 . Thus, this held has no canoni- 
cally conjugate momentum and no dynamics. 

To deal with this problem, we must eliminate the gauge freedom. We 
do this by choosing a gauge. We choose a gauge by imposing a gauge 
condition. This is a condition that we require A fl (x) to satisfy. The idea 
is that there should be only one A^{x) that results in a given F^ v {x) and 
that also satishes the gauge condition. 

One possible class of gauge conditions is nE4 /i (x) = 0, where nJ is a 
constant four- vector. If n is spacelike (n 2 > 0), then we have chosen axial 
gauge ; if n is lightlike, (n 2 = 0), it is lightcone gauge ; and if n is timelike, 
(n 2 < 0), it is temporal gauge. 

Another gauge is Lorenz gauge , where the condition is J'd,, = 0. We 
will meet a family of closely related gauges in section 62. 

In this section, we will work in Coulomb gauge , also known as radiation 
gauge or transverse gauge. The condition for Coulomb gauge is 

V-A(x) = 0. (55.3) 

We can impose eq. (55.3) by acting on Ai{x) with a projection operator, 

Ai{x) -»• ^ - ^^ S jA j (x) . (55.4) 

We construct the right-hand side of eq. (55.4) by Fourier-transforming Aj(x) 
to Ai(k), multiplying Aj(k) by the matrix 5ij k ikj/ k 2 , and then Fourier- 
transforming back to position space. From now on, whenever we write A,;, 
we will implicitly mean the right-hand side of eq. (55.4). 

Now let us write out the lagrangian in terms of the scalar and vector 
potentials, g> = A 0 and A;, with A* obeying the Coulomb gauge condition. 
Starting from eq. (55.2), we get 

C = |AjAj — ^VjAjVjAj + JjA* 
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+ ^VjAjVj-Aj + AV \<p 

+ — pp . (55.5) 

In the second line of eq. (55.5), the V* in each term can be integrated by 
parts; in the first term, we will then get a factor of Vj(VjA)> and in the 
second term, we will get a factor of V*A- Both of these vanish by virtue 
of the gauge condition V*A = 0, and so both of these terms can simply be 
dropped. 

If we now vary p (and require S = / d A x L to be stationary), we find 
that ip obeys Poisson’s equation, 

— V 2 <y? = p . (55.6) 


The solution is 

^• l) = I S,J ryAy ■ <557) 

This solution is unique if we impose the boundary conditions that <p and p 
both vanish at spatial infinity. 

Eq. (55.7) tells us that p(x, t ) is given entirely in terms of the charge 
density at the same time, and so has no dynamics of its own. It is therefore 
legitimate to plug eq. (55.7) back into the lagrangian. After an integration 
by parts to turn VipVip into —pV 2 p = pp, the result is 

C = ^AA — A + Ji-A-i + £ cou i , (55.8) 


where 


Tcoul 


1 f d 3 p(x,t)p(y,t) 

2 J ^ 47r|x— y| 


(55.9) 


We can now vary A; keeping proper track of the implicit projection opera- 
tor in eq. (55.4), we find that A obeys the massless Klein-Gordon equation 
with the projected current as a source, 


- d 2 A.i(x ) = (Sij - x) . 


(55.10) 


For a free field (./j = 0), the general solution is 


A(x) 


E 

A=± 


dk 


4(k)a A (k)e 


ikx 


e A (k)4(k)e-H, 


(55.11) 


where k° = lu = |k|, dk = dA/(27r) 3 2u;, and e+(k) and e_(k) are po- 
larization vectors. In order to satisfy the Coulomb gauge condition, the 
polarization vectors must be orthogonal to the wave vector k. We will 
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choose them to correspond to right- and left-handed circular polarizations; 
for k = (0, 0, k), we then have 

<M k ) = , 


M k ) = 73 ( 1 , +*, 0 ) . 

(55.12) 

More generally, the two polarization vectors along with the unit vector in 
the k direction form an orthonormal and complete set, 

k-£A(k) = 0 , 

(55.13) 

£ A'(k)-e A (k) = 5 X '\ , 

(55.14) 

X £ a(k) £ !A(k) = Sij / . 

A=± K 

(55.15) 

The coefficients a A (k) and a^(k) will become operators after quantization, 
which is why we have used the dagger symbol for conjugation. 

In complete analogy with the procedure used for a scalar field in section 
3, we can invert eq. (55.11) and its time derivative to get 

a A (k) = +ie x {k)- j d 3 x e lkx d 0 A(x ) , 

(55.16) 

at^(k) = — ie A (k )- J d 3 x e +lkx 8 qA(x) , 

(55.17) 

where fd^g = f{d^g) - (d^f)g. 

Now we can proceed to the hamiltonian formalism. First, 
the canonically conjugate momentum to A,, 

we compute 

IT, = dC = Aj . 
dA t 

(55.18) 


Note that VjAj = 0 implies Vjllj = 0. The hamiltonian density is then 

n = n iAi - c 

= ± 11 * 11 * + — JiAi + Hcxmi ■> ( 55 . 19 ) 


where TC CO ui = -£ CO ui- 

To quantize the field, we impose the canonical commutation relations. 
Keeping proper track of the implicit projection operator in eq. (55.4), we 
have 

[Aj(x, t),IIj(y,t)] = i^Sij - ^J±^ 3 (x- y) 


= i 


d3k ik-(x— y) 

( 2 tt )3 




(55.20) 
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The commutation relations of the a A (k) and at (k) operators follow from 
eq. (55.20) and [Aj, Aj] = [II j, II j] = 0 (at equal times). The result is 


[a A (k),a A /(k')] = 0 , 

(55.21) 

[° A ( k )> a A /( k/ )] = o , 

(55.22) 

[a A (k),a^,(k')] = (27r) 3 2w 6 3 (k' - k)<5 AA / . 

(55.23) 


We interpret at^(k) and a A (k) as creation and annihilation operators for 
photons of definite helicity, with helicity +1 corresponding to right-circular 
polarization and helicity — 1 to left-circular polarization. 

It is now straightfoward to write the hamiltonian explicitly in terms of 
these operators. We find 

*=£ / dk u a\(k)a\{k) + 2 £qV — f d 3 x 3(x)- A(x) + H cou \ , (55.24) 

A=± J ^ 

where So = ^(27 r) -3 / d 3 k ui is the zero-point energy per unit volume that 
we found for a real scalar field in section 3, V is the volume of space, the 
Coulomb hamiltonian is 

f My , (55.25) 

2 J 47r|x-y| 

and we use eq. (55.11) to express A t (x) in terms of a A (k) and a^(k) at any 
one particular time (say, t = 0). This is sufficient, because H itself is time 
independent. 

This form of the hamiltonian of electrodynamics is often used as the 
starting point for calculations of atomic transition rates, with the charges 
and currents treated via the nonrelativistic Schrodinger equation. The 
Coulomb interaction appears explicitly, and the J-A term allows for the 
creation and annihilation of photons of definite polarization. 

Reference Notes 

A more rigorous treatment of quantization in Coulomb gauge can be found 
in Weinberg I. 


Problems 

55.1) Use eqs. (55.13-55.20) and [A*, Aj\ = [n^IL,-] = 0 (at equal times) to 
verify eqs. (55.21-55.23). 

55.2) Use eqs. (55.11), (55.14), (55.19), and (55.21-55.23) to verify eq. (55.24). 
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56 LSZ Reduction for Photons 

Prerequisite: 5, 55 

In section 55, we found that the creation and annihilation operators for free 
photons could be written as 

a A (k) = — ie A (k)- J d 3 x e +lkx A(x) , (56.1) 

a A (k) = +i£A(k)- J d 3 x e~ lkx do A(x) , (56.2) 

where e A (k) is a polarization vector. From here, we can follow the analysis 
of section 5 line by line to deduce the LSZ reduction formula for photons. 
The result is that the creation operator for each incoming photon should 
be replaced by 

a{(k) in ->is£*(k) j d 4 x e +lkx (-d 2 )A^(x ) , (56.3) 

and the destruction operator for each outgoing photon should be replaced 

by 

«A(k)out ->• ^A(k) j d 4 x e~ lkx (-d 2 )A^(x) , (56.4) 

and then we should take the vacuum expectation value of the time-ordered 
product. Note that, in writing eqs. (56.3) and (56.4), we have made them 
look nicer by introducing e^(k) = 0, and then using four-vector dot prod- 
ucts rather than three-vector dot products. 

The LSZ formula is valid provided the field is normalized according to 
the free-held formulae 


(0|yT(.x)|0) = 0 , (56.5) 

(MYOPIC) =e\(k)e ikx , (56.6) 

where | Ai, A) is a single photon state, normalized according to 

(P, X'\k, A) = (2tt) 3 2 uj <5 3 (k' - k)d AV . (56.7) 

The zero on the right-hand side of eq. (56.5) is required by rotation invari- 
ance, and only the overall scale of the right-hand side of eq. (56.6) might 
be different in an interacting theory. 

The renormalization of A t necessitates including appropriate Z factors 
in the lagrangian, 


C = + Z\ J'Ajj . 


(56.8) 
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Here and Z\ are the traditional names; we will meet Z2 in section 62. 
We must choose Z3 so that eq. (56.6) holds. We will fix Z\ by requiring the 
corresponding vertex function to take on a certain value for a particular set 
of external momenta. 

Next we must compute the correlation functions (0|TAj(x) . . . |0). As 
usual, we begin by working with free field theory. The analysis is again 
almost identical to the case of a scalar field; see problem 8.4. We find that, 
in free field theory, 


<0|TA’(x)A^)|0) = iA«(:c - y ) , (56.9) 

where the propagator is 

r d 4 k p ik ( x ~y ) 

~y) = j W) 4 -JfT— £A= ± 4*(k)4(k) • (56.10) 

As with a free scalar field, correlations of an odd number of fields van- 
ish, and correlations of an even number of fields are given in terms of the 
propagator by Wick’s theorem; see section 8. 

We would now like to evaluate the path integral for the free electromag- 
netic field 

Z 0 (J) = (0|0)j = j . (56.11) 


Here we treat the current J^ L {x) as an external source. 

We will evaluate Zq(J) in Coulomb gauge. This means that we will 
integrate over only those field configurations that satisfy V-A = 0. 

We begin by integrating over A 0 . Because the action is quadratic in 
A 11 , this is equivalent to solving the variational equation for A 0 , and then 
substituting the solution back into the lagrangian. The result is that we 
have the Coulomb term in the action, 

Seoul = [ d 4 ‘ x d 4 y 5(x°-y°) J ^ X ) J y • (56.12) 

2 J 47r|x— y| 

Since this term does not depend on the vector potential, we simply get a 
factor of exp (?' Seoul) in front of the remaining path integral over A,. We wll 
perform this integral formally (as we did for fermion fields in section 43) 
by requiring it to yield the correct results for the correlation functions of 
A* when we take functional derivatives with respect to J;. In this way we 
find that 


Z 0 (J) = exp 


^S cou i T- ^ 


d A x d A y Ji{x) A lJ (x - y)Jj{y ) . 


(56.13) 
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We can make Zq(J) look prettier by writing it as 


Z 0 (J) = exp 
where we have defined 
A^(x-y) = 


d 4 xd 4 y J M (x) A^(x - y)J v (y ) 


d A k 

Wf 


e ik(x-y) /^(fc) ) 


(56.14) 


(56.15) 


A '“'(*)= + (56.16) 


The first term on the right-hand side of eq. (56.16) reproduces the Coulomb 
term in eq. (56.13) by virtue of the facts that 


j+oo ^_. k0{x0 _ y0) = ^ ^ 

r d 3 k e ik '( x -y) l 


(2vr)2 


k 2 


47t|x - y| 


(56.17) 

(56.18) 


The second term on the right-hand side of eq. (56.16) reproduces the second 
term in eq. (56.13) by virtue of the fact that e°(k) = 0. 

Next we will simplify eq. (56.16). We begin by introducing a unit vector 
in the time direction, 

t> 1 = (1,0) . (56.19) 

Next we need a unit vector in the k direction, which we will call z ,L . We 
first note that t-k = — k°, and so we can write 

(0,k) = k» + {i-k)i» . (56.20) 

The square of this four-vector is 

k 2 = k 2 + {t-k) 2 , (56.21) 


where we have used t 2 = —1. Thus the unit vector that we want is 


^ W + (t-fc)fr 
[. k 2 + {t-k) 2 ] 1 / 2 

Now we recall from section 55 that 

E 4*(k)4(k) = S tj - ■ 

A=± 


k 2 


(56.22) 


(56.23) 
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This can be extended to i — » g and j — > v by writing 

E ^r( k )4(k) = <T + W - z»z v . (56.24) 

A=± 


It is not hard to check that the right-hand side of eq. (56.24) vanishes if 
fi = 0 or u = 0, and agrees with eq. (56.23) for g = i and u = j. Putting 
all this together, we can now write eq. (56.16) as 


A ^(k) 


tn v g yv + tn v - z y z v 

k 2 + (t-k) 2 + k 2 - ie 


(56.25) 


The next step is to consider the terms in this expression that contain 
factors of k y or AT; from eq. (56.22), we see that these will arise from the 
z^z" term. In eq. (56.15), a factor of k y can be written as a derivative with 
respect to aT acting on e lk ^ x ~ y \ This derivative can then be integrated 
by parts in eq. (56.14) to give a factor of d y J fl (x). But d M J M (x) vanishes, 
because the current must be conserved. Similarly, a factor of k u can be 
turned into d u J u (y), and also leads to a vanishing contribution. Therefore, 
we can ignore any terms in A flL '(k) that contain factors of k y or k v . 

From eq. (56.22), we see that this means we can make the substitution 


( t-k)t y 

[ k 2 + (t-k) 2 } 1 / 2 


(56.26) 


Then eq. (56.25) becomes 


t y t v , 
(56.27) 

where the three coefficients of t ll t l/ come from the Coulomb term, the t y t u 
term in the polarization sum, and the z y, z u term, respectively. A bit of 
algebra now reveals that the net coefficient of tJ J t u vanishes, leaving us 
with the elegant expression 

A ^(k) = . (56.28) 

Ar — ie 


A ^(k) = 


k 2 — ie 


<r + - 


k 2 


k 2 + (t-k) 2 


+ 1 - 


(t-kf 


k 2 + (t-k) 2 


Written in this way, the photon propagator is said to be in Feynman gauge. 
(It would still be in Coulomb gauge if we had retained the AT and AT terms 
that we previously dropped.) 

In the next section, we will rederive eq. (56.28) from a more explicit 
path-integral point of view. 


Problems 


56.1) Use eqs. (55.11) and (55.21-55.23) to verify eqs. (56.9-56.10). 
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57 The Path Integral for Photons 

Prerequisite: 8, 56 

In this section, in order to get a better understanding of the photon path 
integral, we will evaluate it directly, using the methods of section 8. We 
begin with 

Z 0 (J) = fvA e iSo , (57.1) 

So = J d 4 x [-{F^F^ + TMj . (57.2) 

Following section 8, we Fourier-transform to momentum space, where we 
find 

So = lI (^ [-Mk)(k?gi"-kW)A v (-k) 

+ J^(k)A^-k) + J^-QA^k )] . (57.3) 

The next step is to shift the integration variable A so as to “complete the 
square”. This involves inverting the 4x4 matrix k 2 g ,ll/ — k ,l k 1 ' . However, 
this matrix has a zero eigenvalue, and cannot be inverted. 

To see this, let us write 

k 2 g^ - Wk v = k 2 P^{k) , (57.4) 

where we have defined 

P^(k) = cT ~ k^kP/k 2 . (57.5) 

This is a projection matrix because, as is easily checked, 

P^ w (k)P u x {k) = P^ x (k) . (57.6) 

Thus the only allowed eigenvalues of P are zero and one. There is at least 
one zero eigenvalue, because 

P^ w (k)k u = 0 . (57.7) 

On the other hand, the sum of the eigenvalues is given by the trace 

g^ w (k) = 3 . (57.8) 


Thus the remaining three eigenvalues must all be one. 
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Now let us imagine carrying out the path integral of eq. (57.1), with 
Sq given by eq. (57.3). Let us decompose the field A fl (k) into components 
aligned along a set of linearly independent four- vectors, one of which is k fl . 
(It will not matter whether or not this basis set is orthonormal.) Because 
the term quadratic in A ^ involves the matrix k 2 P flu (k), and P ,iu (k)k l/ = 0, 
the component of A^(k) that lies along /c /t does not contribute to this 
quadratic term. Furthermore, it does not contribute to the linear term 
either, because d^J^x) = 0 implies k^ l J^{k) = 0. Thus this component 
does not appear in the path integal at all! It then makes no sense to 
integrate over it. We therefore define / VA to mean integration over only 
those components that are spanned by the remaining three basis vectors, 
and therefore satisfy k^A^ik) = 0. This is equivalent to imposing Lorenz 
gauge, d^A^x) = 0. 

The matrix P fiu (k) is simply the matrix that projects a four- vector into 
the subspace orthogonal to kJ 1 . Within the subspace, P^ u (k) is equiva- 
lent to the identity matrix. Therefore, within the subspace, the inverse 
of k 2 P^ ll (k ) is (l/k 2 )P* lv (k). Employing the e trick to pick out vacuum 
boundary conditions replaces k 2 with k 2 — ie. 

We can now continue following the procedure of section 8, with the 
result that 


Zq{J) = exp 


= exp 


where 


f ^ J (k) P " V{k) J ( Ml 
' (2tt) 4 M } k 2 -ie M ’ 


^ d A x d A y J M ( x)A ttv {x - y)J„{y ) 

, (57.9) 

f d A k ik(x _ v) P^(k) 
J (27t) 4 k 2 — ie 

(57.10) 


is the photon propagator in Lorenz gauge (also known as Landau gauge). 
Of course, because the current is conserved, the k^k u term in P^ u (k) does 
not contribute, and so the result is equivalent to that of Feynman gauge, 
where P^ LU {k) is replaced by g tlv . 
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58 Spinor Electrodynamics 

Prerequisite: 45, 57 

In the section, we will study spinor electrodynamics : the theory of photons 
interacting with the electrons and positrons of a Dirac field. (We will 
use the term quantum electrodynamics to denote any theory of photons, 
irrespective of the kinds of particles with which they interact.) 

We construct spinor electrodynamics by taking the electromagnetic cur- 
rent j ,L (x) to be proportional to the Noether current corresponding to the 
U(l) symmetry of a Dirac field; see section 36. Specifically, 

j^(x) = (x)Y l ^t (x) . (58.1) 

Here e = —0.302822 is the charge of the electron in Heaviside-Lorentz 
units, with % = c = 1. (We will rely on context to distinguish this e 
from the base of natural logarithms.) In these units, the fine-structure 
constant is a = e 2 /47r = 1/137.036. With the normalization of eq. (58.1), 
Q = / d 3 xj°(x ) is the electric charge operator. 

Of course, when we specify a number in quantum field theory, we must 
always have a renormalization scheme in mind; e = —0.302822 corresponds 
to a specific version of on-shell renormalization that we will explore in 
sections 62 and 63. The value of e is different in other renormalization 
schemes, such as MS, as we will see in section 66. 

The complete lagrangian of our theory is thus 

C = - \F IIV F IW + - mW + . (58.2) 

In this section, we will be concerned with tree-level processes only, and so 
we omit renormalizing Z factors. 

We have a problem, though. A Noether current is conserved only when 
the fields obey the equations of motion, or, equivalently, only at points 
in field space where the action is stationary. On the other hand, in our 
development of photon path integrals in sections 56 and 57, we assumed 
that the current was always conserved. 

This issue is resolved by enlarging the definition of a gauge transfor- 
mation to include a transformation on the Dirac field as well as the elec- 
tromagnetic field. Specifically, we define a gauge transformation to consist 
of 


M\x) - 

-> A»{x) - ^T(x) , 

(58.3) 

T(x) - 

exp[— ieT{x)]^{x) , 

(58.4) 

T(x) - 

-> exp[+ier(x)]T(x) . 

(58.5) 
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It is not hard to check that C(x) is invariant under this transformation, 
whether or not the fields obey their equations of motion. To perform this 
check most easily, we first rewrite C as 

C = -\F' llJ F jJV + - mW , (58.6) 

where we have defined the gauge covariant derivative (or just covariant 
derivative for short) 

D lt = d lt - ieA M . (58.7) 

In the last section, we found that F IIU is invariant under eq. (58.3), and 
so the FF term in C is obviously invariant as well. It is also obvious 
that the juTT term in C is invariant under eqs. (58.4) and (58.5). This 
leaves the term. This term will also be invariant if, under the gauge 

transformation, the covariant derivative of \k transforms as 

D^{x) — > exp[— ieT{x)\D^(x) . (58.8) 

To see if this is true, we note that 

D - ie[A lt - c^T]) (expf-zeTjT) 

= exp[-zeT] (c^'k - ze^Tj'k - ie[A M - d^T]^ 

= exp[— zeT] ( d M - ieA^j T 

= exp[— zeTj.D^’k . (58.9) 

So eq. (58.8) holds, and is gauge invariant. 

We can also write the transformation rule for a little more abstractly 
as 

D fl e~ ieT D fl e +ieT , (58.10) 

where the ordinary derivative in D ^ is defined to act on anything to its 
right, including any fields that are left unwritten in eq. (58.10). Thus we 
have 


D, l F -> (e~ ieV D ^e +ieV ^j (e" ier T) 

= e~ ieV D ^ , (58.11) 

which is, of course, the same as eq. (58.9). We can also express the field 
strength in terms of the covariant derivative by noting that 


[D^,D u ]^{x) = -zeFHx)T(x) • 


(58.12) 
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We can write this more abstractly as 

F IW = l[D>\D u ] , (58.13) 

where, again, the ordinary derivative in each covariant derivative acts on 
anything to its right. From eqs. (58.10) and (58.13), we see that, under a 
gauge transformation, 

e~ ieV D e +ieT , c~ ieV D v e +ieT ' 

= e~ ieV (^[D^,D v ])e +ieV 

— e ~ieT pftu e +ieT 

= . (58.14) 

In the last line, we are able to cancel the e ±ieV factors against each other 
because no derivatives act on them. Eq. (58.14) shows us that (as we already 
knew) F^ ll/ is gauge invariant. 

It is interesting to note that the gauge transformation on the fermion 
fields, eqs. (58.4-58.5), is a generalization of the U(l) transformation 

4' -► e~ ia ^ , (58.15) 

® -> e +i “W , (58.16) 

that is a symmetry of the free Dirac lagrangian. The difference is that, 
in the gauge transformation, the phase factor is allowed to be a function 
of spacetime, rather than a constant that is the same everywhere. Thus, 
the gauge transformation is also called a local U(l) transformation, while 
eqs. (58.15-58.16) correspond to a global U(l) transformation. We say that, 
in a gauge theory, the global U(l) symmetry is promoted to a local U(l) 
symmetry, or that we have gauged the U(l) symmetry. 

In section 57, we argued that the path integral over A M should be re- 
stricted to those components of A^{k) that are orthogonal to k because 
the component parallel to did not appear in the integrand. Now we must 
make a slightly more subtle argument. We argue that the path integral over 
the parallel component is redundant, because the fermionic path integral 
over 'k and T already includes all possible values of T(a:). Therefore, as in 
section 57, we should not integrate over the parallel component. (We will 
make a more precise and careful version of this argument when we discuss 
the quantization of nonabelian gauge theories in section 71.) 
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By the standard procedure, this leads us to the following form of the 
path integral for spinor electrodynamics: 


Z(r], rj, J) oc exp 


ie / d A x 


1 


i 5J^{x) J\ 5p a {x 




*7/? (a 


X z o{v , 


(58.17) 


where 


z o (v,V,J) = ex P i J 

a 

d A xd A y rj{x)S(x — y)g{y) 


x exp 

l - j d A xd A y J^(x) A^ix - y)J u (y) 

, (58.18) 

and 



i 

II 

'S 

l 

co 

f d^P + j p(x - y ) 

1 (27t) 4 p 2 + m 1 2 — ie 6 

(58.19) 

A^(x -y) = 

f d 4 k g pv ik(x-y) 

1 (27t) 4 k 2 — ie 

(58.20) 

are the appropriate Feynman propagators for the corresponding free fields, 
with the photon propagator in Feynman gauge. We impose the normaliza- 
tion Z(0,0, 0) = 1, and write 

= exp[iW(rj,r], J)} . 

(58.21) 


Then iW ( rj , rj , J) can be expressed as a series of connected Feynman dia- 
grams with sources. 

The rules for internal and external Dirac fermions were worked out in 
the context of Yukawa theory in section 45, and they follow here with no 
change. For external photons, the LSZ analysis of section 56 implies that 
each external photon line carries a factor of the polarization vector e^(k). 

Putting everything together, we get the following set of Feynman rules 
for tree-level processes in spinor electrodynamics. 

1. For each incoming electron, draw a solid line with an arrow pointed 
towards the vertex, and label it with the electron’s four-momentum, 
Pi- 

2. For each outgoing electron , draw a solid line with an arrow pointed 
away from the vertex, and label it with the electron’s four-momentum, 
Pi- 
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3. For each incoming positron, draw a solid line with an arrow pointed 
away from the vertex, and label it with minus the positron’s four- 
momentum, —pi- 


4. For each outgoing positron, draw a solid line with an arrow pointed 

towards the vertex, and label it with minus the positron’s four-momentum, 

~p'v 


5. For each incoming photon, draw a wavy line with an arrow pointed 
towards the vertex, and label it with the photon’s four-momentum, 
ki . (Wavy lines for photons is a standard convention.) 


6. For each outgoing photon, draw a wavy line with an arrow pointed 
away from the vertex, and label it with the photon’s four-momentum, 

K ' 

7. The only allowed vertex joins two solid lines, one with an arrow point- 
ing towards it and one with an arrow pointing away from it, and one 
wavy line (whose arrow can point in either direction). Using this 
vertex, join up all the external lines, including extra internal lines as 
needed. In this way, draw all possible diagrams that are topologically 
inequivalent. 


8. Assign each internal line its own four-momentum. Think of the four- 
momenta as flowing along the arrows, and conserve four-momentum 
at each vertex. For a tree diagram, this fixes the momenta on all the 
internal lines. 


9. The value of a diagram consists of the following factors: 
for each incoming photon, e((*(kj); 
for each outgoing photon, e((,.(k'); 
for each incoming electron, u Si (pj ); 
for each outgoing electron, u s > (p(); 
for each incoming positron, u Si (pj); 
for each outgoing positron, v s i (p(); 
for each vertex, 

for each internal photon, —ig^/lk 1 — ie); 

for each internal fermion, —i(—rf + m)/(p 2 + m 2 — ie). 

10. Spinor indices are contracted by starting at one end of a fermion 
line: specifically, the end that has the arrow pointing away from the 
vertex. The factor associated with the external line is either u or v. 
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Go along the complete fermion line, following the arrows backwards, 
and write down (in order from left to right) the factors associated with 
the vertices and propagators that you encounter. The last factor is 
either a u or v. Repeat this procedure for the other fermion lines, if 
any. The vector index on each vertex is contracted with the vector 
index on either the photon propagator (if the attached photon line 
is internal) or the photon polarization vector (if the attached photon 
line is external). 

11. The overall sign of a tree diagram is determined by drawing all con- 
tributing diagrams in a standard form: all fermion lines horizontal, 
with their arrows pointing from left to right, and with the left end- 
points labeled in the same fixed order (from top to bottom); if the 
ordering of the labels on the right endpoints of the fermion lines in a 
given diagram is an even (odd) permutation of an arbitrarily chosen 
fixed ordering, then the sign of that diagram is positive (negative). 

12. The value of iT (at tree level) is given by a sum over the values of all 
the contributing diagrams. 

In the next section, we will do a sample calculation. 

Problems 

58.1) Compute P~ 1 A fl (x,t)P, T” 1 yl^(x, t)T, and C' - 1 A /i (x, t)C, assuming 
that P, T, and C are symmetries of the lagrangian. (Prerequisite: 
40.) 


58.2) Furry’s theorem. Show that any scattering amplitude with no exter- 
nal fermions, and an odd number of external photons, is zero. 
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59 Scattering in Spinor Electrodynamics 

Prerequisite: 48, 58 


In the last section, we wrote down the Feynman rules for spinor electro- 
dynamics. In this section, we will compute the scattering amplitude (and 
its spin-averaged square) at tree level for the process of electron-positron 
annihilation into a pair of photons, e + e~ — > 77. 

The contributing diagrams are shown in fig. (59.1), and the associated 
expression for the scattering amplitude is 


ZyEy v 2 \1 v 


~A + ¥1 + m 

—t + m I 2 


~A + $2 + m 

—u + m 2 


(59.1) 


where £y is shorthand for e^, (k^), V 2 is shorthand for u S2 (p2), and so on. 
The Mandelstam variables are 


s — ~ (Pi + P2) 2 — —(A + k'2) 2 , 

t = -(pi- A ) 2 = ~{P 2 - k ' 2 ) 2 , 

u = ~(pi - k'2) 2 = (jp2 ~ k [) 2 , 

and they obey s + t + u = 2 m 2 . 

Following the procedure of section 46, we write eq. (59.1) as 

T = £$£%, v 2 A tu/ ui , 

where 


6 


~A + A + m 

—t + m 2 


7m + 7m 


~A + V2 + m \ _ ‘ 

—u + m 2 ) _ 


We also have 

T* = T = £%£%U iV ^ 2 • 

Using jif ... = ... fj,, we see from eq. (59.4) that 


Thus we have 



(59.2) 

(59.3) 

(59.4) 

(59.5) 

(59.6) 


|T| 2 = £y £ 2 ' £y£2* (v 2 ^/La/Wl)(«l^opV 2 ) • (59.7) 

Next, we will average over the initial electron and positron spins, using 
the technology of section 46; the result is 

I I 7 "! 2 = \ £ v £ 2' £ v £ 2* Tr A^(-A+in)Ao P (-A- m ) • (59-8) 

S 1 , S 2 
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Figure 59.1: Diagrams for e + e — > 77 , corresponding to eq. (59.1). 


We would also like to sum over the final photon polarizations. From 
eq. (59.8), we see that we must evaluate 

E £ A( k K*( k ) • ( 59 - 9 ) 

A=± 


We did this polarization sum in Coulomb gauge in section 56, with the 
result that 

E 4‘( k K*( k ) = + in? - z»zP , (59.10) 

A=± 

where t p is a unit vector in the time direction, and z p is a unit vector in 
the k direction that can be expressed as 


k p + (■ t-k )f /i 
[. k 2 + (t-k) 2 } 1 / 2 


(59.11) 


It is tempting to drop the k p and k p terms in eq. (59.10), on the grounds 
that the photons couple to a conserved current, and so these terms should 
not contribute. (We indeed used this argument to drop the analogous 
terms in the photon propagator.) This also follows from the notion that 
the scattering amplitude should be invariant under a gauge transformation, 
as represented by a transformation of the external polarization vectors of 
the form 

e£(k) e£(k) - it(k)k p . (59.12) 

Thus, if we write a scattering amplitude T for a process that includes a 
particular outgoing photon with four-momentum k p as 


T = ^(k )M fJ , 


(59.13) 


or a particular incoming photon with four-momentum k p as 


T = £ f(k)M /i , 


(59.14) 
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then in either case we should have 

k»M lx = 0. (59.15) 

Eq. (59.15) is in fact valid; we will give a proof of it, based on the Ward 
identity for the electromagnetic current, in section 67. For now, we will 
take eq. (59.15) as given, and so drop the and k p terms in eq. (59.10). 
This leaves us with 

E - (5!u6) 

But, for an external photon, k 2 = 0. Thus the second and third terms in 
eq. (59.16) cancel, leaving us with the beautifully simple substitution rule 

E ^(k) £ ;*(k) -> <r . (59.17) 

A=± 

Using eq. (59.17), we can sum \T\ 2 over the polarizations of the outgoing 
photons, in addition to averaging over the spins of the incoming fermions; 
the result is 


(in 2 ) MEE m 2 

A', A!, «i>«2 


where 

<$«> 


(^u«) 


($tu) 

($ut) 





. (m 2 — t ) 2 


+ 


($tu) + ($ut) 
(m 2 — t)(?n 2 — u) 


+ 


($uu) 

(m 2 — u ) 2 . ’ 


(59.18) 


TV 


iTi- 

iTi- 


Tr 


lu ( ~i > x +^i +m) 7 M ( +m)y fl (-rf 1 +l'x+m)Y( ~i> 2 

7/ t (-A+^2+ m )7v(-^i+m)7 I/ (-^ 1 +^ 2 +m)7 M (-^ 2 

lA-rfi+fti+rn)y IJ ,{-rfx+rn)Y(-A+ft2+™h ,J '(-lt2 

7M(-A+^2+ TO )7v(-^i+m)7 At (-^ 1 +^ 1 +m)7 !/ (-^ 2 


-m)J , 

-m) , 

-m) , 

—m) . 
(59.19) 


Examinging {$tt} and ( < hu U ), we see that they are transformed into each 
other by k\ k' 2 , which is equivalent to t <-> u. The same is true of (&tu) 
and Thus we need only compute ($«) and (<£*„), and then take 

t u to get ($ uu ) and ($ ut ). 
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Now we can apply the gamma-matrix technology of section 47. In 
particular, we will need the d = 4 relations 

7 m 7m = ~ 4 > 

7 # ¥7m = > 

7 a ‘#7m = 4 ( af> ) > 

7 = 2jjj, , (59.20) 

in addition to the trace formulae. We also need 

P1P2 = -|(s - 2 m?) , 

& 1&2 = , 

Pik'i = P 2?2 = +\{t~ rr?) , 

p\k' 2 = P 2 k\ = + 7 *(it — rr?) . (59.21) 

which follow from eq. (59.2) plus the mass-shell conditions p\ = = —m 2 

and k' 2 = k! 2 = 0. After a lengthy and tedious calculation, we find 

(&tt) = 2 [tu — it? (3f + u) — m 4 ] , (59.22) 

<$tu> = 2m? (s - Am 2 ) , (59.23) 

which then implies 

(4> uu ) = 2 [tu — m?{3u + t) — m 4 ] , (59.24) 

= 2m 2 (s - 4 m 2 ) . (59.25) 

This completes our calculation. 

Other tree-level scattering processes in spinor electrodynamics pose no 
new calculational difficulties, and are left to the problems. 

In the high-energy limit, where the electron can be treated as massless, 
we can reduce our labor with the method of spinor helicity, which was 
introduced in section 50. We take this up in the next section. 


Problems 

59.1) Compute (|T| 2 ) for Compton scattering, e~ y — > e~y. You should 
find that your result is the same as that for e + e~ — » 77 , but with 
s t, and an extra overall minus sign. This is an example of crossing 
symmetry ; there is an overall minus sign for each fermion that is 
moved from the initial to the final state. 
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59.2) Compute ( |T| 2 ) for Bhabha scattering , e + e — ► e + e . 

59.3) Compute (|T| 2 ) for Mgller scattering , e“e~ — > e _ e~. 
find that your result is the same as that for e + e _ — > 

s <-* u. This is another example of crossing symmetry. 


You should 
e ~ , but with 
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60 Spinor Helicity for Spinor 
Electrodynamics 

Prerequisite: 50, 59 

In section 50, we introduced a special notation for u and v spinors of definite 
helicity for massless electrons and positrons. This notation greatly simpli- 
fies calculations in the high-energy limit (s, |f|, and |u| all much greater 
than rn 2 ). 

We define the twistors 

\p\ = u-(p ) = v + (p) , 

I p) = U+(p) = v-(p) , 
bl = u+(p) = v~(p) , 

{p\ =u-(p) =v + (p) . (60.1) 

We then have 

bl b] = [kp] » 

(k\ \p) = (kp) , 

[k\ \p) = 0 , 

(k\ b] = 0 , (60.2) 

where the twistor products [kp] and (kp) are antisymmetric, 

[kp] = ~\pk] , 

(kp) = —(pk) , (60.3) 

and related by complex conjugation, (pk)* = [kp]. They can be expressed 
explicitly in terms of the components of the massless four-momenta k and 
p. However, more useful are the relations 

(kp) [pk] = Tr ^(1 — 75 W 

= -2k-p 

= — (k + p) 2 (60.4) 

and 

(pq) [qr] (r s) [sp] = Tr±(l- 7 5 )#^ 

= 2[(p-q)(r-s) - ( p-r)(q-s ) + (p-s)(q-r) 

+ is^^p^rpSa] . 


(60.5) 
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Finally, for any massless four-momentum p we can write 


+ 


(60.6) 


We will quote other results from section 50 as we need them. 

To apply this formalism to spinor electrodynamics, we need to write 
photon polarization vectors in terms of twistors. The formulae we need are 


4(fc) = 


e»(k) = 


(g|7^|fe] 

v/2 (qk) 

[g|7 M |fe) 

v/2 [qk\ 


(60.7) 

(60.8) 


where q is an arbitrary massless reference momentum. 

We will verify eqs. (60.7) and (60.8) for a specific choice of k, and then 
rely on the Lorentz transformation properties of twistors to conclude that 
the result must hold in any frame (and therefore for any massless four- 
momentum k). 

We will choose k 11 = (u,ujz) = cj( 1, 0,0,1). Then, the most general 
form of £+(k) is 

4(i) = e^(fl,l,-i,0)+CF . (60.9) 


Here is an arbitrary phase factor, and C is an arbitrary complex num- 
ber; the freedom to add a multiple of k comes from the underlying gauge 
invariance. 

To verify that eq. (60.7) reproduces eq. (60.9), we need the explicit form 
of the twistors | k] and \k) when the three-momentum is in the z direction. 
Using results in section 50 we find 


fc] = v/2 cJ 

1 

0 

, | k) = v/2t o 

(°\ 

0 

1 


Vo/ 


Vo/ 


For any value of q, the twistor (q\ takes the form 

(q\ = (0, 0, a, (3) , 


(60.10) 


(60.11) 


where a and /3 are complex numbers. Plugging eqs. (60.10) and (60.11) into 
eq. (60.7), and using 


r = 




(60.12) 
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along with = (I, o') and = (I,— a), we find that we reproduce 
eq. (60.9) with = 1 and C = — /3 / (\/2mu) . There is now no need to check 
eq. (60.8), because £_{k) = — [£+(&)]*, as can be seen by using ( qk )* = 
— [qk] along with another result from section 50, (q\'y ll \k]* = (fc|7 M |(/]. 

In spinor electodynamics, the vector index on a photon polarization 
vector is always contracted with the vector index on a gamma matrix. We 
can get a convenient formula for ^ _(k) by using the Fierz identities 


We then have 


= \k]{q\ + \q)[k\ , 

(60.13) 

= \ k M + k](fc| • 

(60.14) 

t + {k-,q) = ^ (\k](q\ + I?)[fc|) , 

(60.15) 

i-ik'^) = (\k)[q\ + M(fc|) , 

(60.16) 


where we have added the reference momentum as an explicit argument on 
the left-hand sides. 

Now we have all the tools we need for doing calculations. However, 
we can simplify things even further by making maximal use of crossing 
symmetry. 

Note from eq. (60.1) that U- (which is the factor associated with an in- 
coming electron) and (an outgoing positron) are both represented by the 
twistor |p], while (an outgoing electron) and (an incoming positron) 
are both represented by [p |. Thus the square-bracket twistors correspond 
to outgoing fermions with positive helicity, and incoming fermions with 
negative helicity. Similarly, the angle-bracket twistors correspond to out- 
going fermions with negative helicity, and incoming fermions with positive 
helicity. 

Let us adopt a convention in which all particles are assigned four- 
momenta that are treated as outgoing. A particle that has an assigned four- 
momentum p then has physical four-momentum e p p, where e p = sign(p°) = 
+1 if the particle is physically outgoing, and e p = sign(p°) = — 1 if the 
particle is physically incoming. 

Since the physical three-momentum of an incoming particle is opposite 
to its assigned three-momentum, a particle with negative helicity relative 
to its physical three-momentum has positive helicity relative to its assigned 
three-momentum. From now on, we will refer to the helicity of a particle 
relative to its assigned momentum. Thus a particle that we say has “pos- 
itive helicity” actually has negative physical helicity if it is incoming, and 
positive physical helicity if it is outgoing. 
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-Pi 


> 


~P 2 


> 






Figure 60.1: Diagrams for fermion- fermion scattering, with all momenta 
treated as outgoing. 

With this convention, the square-bracket twistors \p\ and [p| represent 
positive-helicity fermions, and the angle-bracket twistors | p) and (p\ repre- 
sent negative-helicity fermions. When e p = sign(p°) = — 1, we analytically 
continue the twistors by replacing each cj 1 / 2 in eq. (60.10) with ilcjj 1 / 2 . Then 
all of our formulae for twistors and polarizations hold without change, with 
the exception of the rule for complex conjugation of a twistor product, 
which becomes 

(p k)* = e p e k [k p\ . (60.17) 

Now we are ready to calculate some amplitudes. Consider first the 
process of fermion-fermion scattering. The contributing tree-level diagrams 
are shown in fig. (60.1). 

The first thing to notice is that a diagram is zero if two external fermion 
lines that meet at a vertex have the same helicity. This is because (as shown 
in section 50) we get zero if we sandwich the product of an odd number of 
gamma matrices between two twistors of the same helicity. In particular, 
we have (p\'y tJ '\k} = 0 and [p| 7 ^|A;] = 0. Thus, we will get a nonzero result 
for the tree-level amplitude only if two of the helicities are positive, and 
two are negative. This means that, of the 2 4 = 16 possible combinations 

of helicities, only six give a nonzero tree-level amplitude: 7+_| , T”) | , 

7I| 71 71_| and 77 + _j , where the notation is T S1S2S3SA . Fur- 

thermore, the last three of these are related to the first three by complex 
conjugation, so we only have three amplitudes to compute. 

Let us begin with 71) j_. Only the first diagram of fig. (60.1) con- 

tributes, because the second has two postive-helicity lines meeting at a 
vertex. To evaluate the first diagram, we note that the two vertices con- 
tribute a factor of (ie ) 2 = — e 2 , and the internal photon line contributes a 
factor of ig pu /s 13 , where we have defined the Mandelstam variable 

Sij = -{pi+pj) 2 . (60.18) 

Following the charge arrows backwards on each fermion line, and dividing 
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by i to get T (rather than iT), we find 

T + .- + = -e 2 <3| 7 H1] [4| 7 m|2 > Ms 

= +2e 2 [14](2 3)/si 3 , (60.19) 

where (3| is short for ( j>3 1 , etc, and we have used yet another form of the 
Fierz identity to get the second line. 

The computation of T ) | is exactly analogous, except that now it is 

only the second diagram of fig. (60.1) that contributes. According to the 
Feynman rules, this diagram comes with a relative minus sign, and so we 
have 

T+_ + _ = — 2e 2 [13] (24) /s 1A . (60.20) 

Finally, we turn to Now both diagrams contribute, and we have 

<3| T ^|1] <4| Tm |2] (4| 7 ^|1](3| 7/ J2] \ 

Sl3 S14 ) 

= — 2e 2 [12] (3 4) (— + —) 

\S13 SuJ 

= +2e 2 [12](3 4) , (60.21) 

VS13S14/ 

where we used the Mandelstam relation S12 + S13 + si4 = 0 to get the last 
line. 

To get the cross section for a particular set of helicities, we must take the 
absolute squares of the amplitudes. These follow from eqs. (60.4), (60.17), 
and (60.18) : 

1(1 2)| 2 = | [1 2] | 2 = e ie2 si 2 = M • (60.22) 

We can then compute the spin-averaged cross section by summing the ab- 
solute squares of eqs. (60. 19-60. 21), multiplying by two to account for the 
processes in which all helicities are opposite (and which have amplitudes 
that are related by complex conjugation), and then dividing by four to aver- 
age over the initial helicities. Making use of S34 = S12 and its permutations, 
we find 

<|TR = 2e‘(# + 

V S 13 s 14 S 13 S U 

= 2e 4 ( 4l±4i± 40 (60.23) 

V s 13 s 14 J 

For the processes of e~e~ —> e~e~ and e + e + — > e + e + , we have S12 = s, 
s 13 = t, and S14 = u\ for e + e“ — ► e + e~, we have S13 = -s, S14 = t, and 
S12 = u. 
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Figure 60.2: Diagrams for fermion-photon scattering, with all momenta 
treated as outgoing. 

Now we turn to processes with two external fermions and two external 
photons, as shown in fig. (60.2). The first thing to notice is that a diagram is 
zero if the two external fermion lines have the same helicity. This is because 
the corresponding twistors sandwich an odd number of gamma matrices: 
one from each vertex, and one from the massless fermion propagator S(p) = 

— tHp 2 ■ Thus we need only compute 71) a 3 a 4 since 71 + a 3 a 4 is related by 

complex conjugation. 

Next we use eqs. (60.15-60.16) and (60.2-60.3) to get 


f-(k\p)\p] = 0 , 

(60.24) 

W-ihp) = 0 . 

(60.25) 

t+(k;p)\p) = 0 , 

(60.26) 

{p\ <t+(k;p) = 0 , 

(60.27) 


Thus we can get some amplitudes to vanish with appropriate choices of the 
reference momenta in the photon polarizations. 

So, let us consider 

7+-a 3 a 4 = ~e 2 (2\?\Sk 4 m){A + h)t\ 3 ( k 3m)\M /*13 

-e 2 ( 2 I^A 3 ( fc 3 m){A + 4 (fc 4 ;?4)|l] /*i4 • (60.28) 

If we take A3 = A4 = — , then we can get both terms in eq. (60.28) to vanish 
by choosing <73 = = p\, and using eq. (60.24). If we take A3 = A4 = +, 

then we can get both terms in eq. (60.28) to vanish by choosing <73 = <74 = P 2 , 
and using eq. (60.27). 

Thus, we need only compute 71) |_ and 71) )__. For 7) | , we can 

get the second term in eq. (60.28) to vanish by choosing <73 = P 2 , and using 
eq. (60.27). Then we have 

T+-+- = -e 2 (2|^_(fc 4 ;<7 4 )(Vi + ^ 3 )M fc 3;72)|l] /«13 
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2 V2 

e fe4] 


(24) [ g 4 |(z*i + &,)l 2 > [31] 


y/2 1 

(23) S13 ' 


(60.29) 


Next we note that \p\tf = 0, and so it is useful to choose either = p\ 
or (74 = k'i . There is no obvious advantage in one choice over the other, 
and they must give equivalent results, so let us take <74 = /c.3. Then, using 
eq. (60.6) for ^ 3 , we get 


2 (2 4) [31] (12) [31] 
[3 4] (2 3) s 13 


(60.30) 


Now we use [31] (12) = —[34] (42) in the numerator (see problem 60.2), 
and set S13 = (13) [31] in the denominator. Canceling common factors and 
using antisymmetry of the twistor product then yields 



(2 4) 2 
(1 3) (2 3) ' 


(60.31) 


We can now get T) |_ simply by exchanging the labels 3 and 4, 


T+—+ 


(2 3) 

(1 4) (2 4) ' 


(60.32) 


We can compute the spin-averaged cross section by summing the abso- 
lute squares of eqs. (60.31) and (60.32), multiplying by two to account for 
the processes in which all helicities are opposite (and which have ampli- 
tudes that are related by complex conjugation), and then dividing by four 
to average over the initial helicities. The result is 


(|T| 2 ) = 2e 4 


513 

514 



(60.33) 


For the processes of e~y —> e _ 7 and e + 7 — ► e + 7, we have S13 = s, s 12 = t, 
and S14 = u] for e + e^ — ► 77 and 77 — ► e + e _ we have S12 = s, s \ 3 = t, and 
S14 = u. 


Problems 


60.1) a) Show that 


p-e + (t,q) 


p-£-(k]q) 


(■ QP ) [P k ] 
\/2 (qk) 

[qp] (pk) 

v/2 [q k] 


(60.34) 


(60.35) 
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Use this result to show that 


k-s±(k;q ) = 0 , (60.36) 

which is required by gauge invariance, and also that 


q-£±(k;q) = 0 . 


(60.37) 


b) Show that 

■ ( 6a38 > 

. (60.39) 

£+ (*;«)-M*W) = ■ ( 6 »- 40 > 

Note that the right-hand sides of eqs. (60.38) and (60.39) vanish if 
q' = q, and that the right-hand side of eq. (60.40) vanishes if q = k! 
or q' = k. 

60.2) a) For a process with n external particles, and all momenta treated 
as outgoing, show that 


i 1 -]) [j k ] = 0 and ^[i j] (j k) = 0 . 
i = i i= i 


(60.41) 


Hint: make use of eq. (60.6). 

b) For n = 4, show that [3 1] (12)= — [34] (42). 

60.3) Use various identities to show that eq. (60.31) can also be written as 

[1 3] 2 


T + _ + _ = —2m 


[14] [2 4] 


(60.42) 


60.4) a) Show explicitly that you would get the same result as eq. (60.31) 
if you set <74 = p\ in eq. (60.29). 

b) Show explicitly that you would get the same result as eq. (60.31) 
if you set q± = P 2 in eq. (60.29). 
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61 Scalar Electrodynamics 

Prerequisite: 58 

In this section, we will consider how charged spin-zero particles interact 
with photons. We begin with the lagrangian for a complex scalar field with 
a quartic interaction, 

C = —d^tp^dptp — m 2 ^ip — | A (v?V) 2 • (61.1) 

This lagrangian is obviously invariant under the global U(l) symmetry 

<p(x) -> e~ ia <p(x) , 

ip\x) — > e +ia <p\x) . (61-2) 

We would like to promote this global symmetry to a local symmetry, 

ip(x) — > exp[— ieT{x)\<p{x) , (61.3) 

p\x) — > exp[+zer(x)](/?l(x) . (61-4) 

To do so, we must replace each ordinary derivative in eq. (61.1) with a 
covariant derivative 

Dfj. = d,, - ieA^ , (61.5) 

where A M transforms as 

A fl (x) — > A M (x) — <9 A T(x) , (61.6) 

which implies that D p transforms as 

— ► exp[— ieT(x)]D tt exp[+ier(x)] . (61-7) 

Our complete lagrangian for scalar electrodynamics is then 

C = -(D^D^cp - mVV - i A ((^V) 2 - \F^F^ . (61.8) 

We have added the usual gauge- invariant kinetic term for the gauge field. 
The quartic interaction term has a dimensionless coefficient, and so is nec- 
essary for renormalizability. For now, we omit the renormalizing Z factors. 

Of course, eq. (61.8) is invariant under a global U(l) transformation as 
well as a local U(l) transformation: we simply set T(x) to a constant. Then 
we can find the conserved Noether current corresponding to this symmetry, 
following the procedure of section 22. In the case of spinor electrodynamics, 
this current is same as it is for a free Dirac field, j ,Jj = In the case 

of a complex scalar field, we find 


(61.9) 
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Figure 61.1: The three vertices of scalar electrodynamics; the corresponding 
vertex factors are ie(k + k')^, —2ie 2 g flL ,, and — iX. 

With a factor of e, this current should be identified as the electromag- 
netic current. Because the covariant derivative appears in eq. (61.9), the 
electromagnetic current depends explicitly on the gauge field. We had not 
previously contemplated this possibility, but in scalar electrodynamics it 
arises naturally, and is essential for gauge invariance. 

It also poses no special problem in the quantum theory. We will make 
the same assumption that we did for spinor electrodynamics: namely, that 
the correct procedure is to omit integration over the component of A^{k) 
that is parallel to k^, on the grounds that this integration is redundant. 
This leads to the same Feynman rules for internal and external photons 
as in section 58. The Feynran rules for internal and external scalars are 
the same as those of problem 10.2. We will call the spin-zero particle with 
electric charge +e a scalar electron or selectron (recall that our convention 
is that e is negative), and the spin-zero particle with electric charge — e 
a scalar positron or spositron. Scalar lines (traditionally drawn as dashed 
in scalar electrodynamics) carry a charge arrow whose direction must be 
preserved when lines are joined by vertices. 

To determine the kinds of vertices we have, we first write out the inter- 
action terms in the lagrangian of eq. (61.8): 

Ci = ieA^[(d^)(p - y ] d^\ - e 2 A^A^ tp - \\(^p>) 2 . (61.10) 

This leads to the vertices shown in fig. (61.1). The vertex factors associated 
with the last two terms are —2 ie 2 g fiu and — iX. To get the vertex factor 
for the first term, we note that if | k) is an incoming selectron state, then 
(0|(/?(x)| k) = e Lkx and (0|^(.x)|/c) = 0; and if (k'\ is an outgoing selectron 
state, then (A/|<^t(a;)|0) = e~ lk ' x and (0|<^(.x)|fc) = 0. Therefore, in free field 
theory, 

(k'\(d^)cp\k) = -ifa-w-Q* , 

(k'l^d^k) = +ik^ k '~ k > . 


(61.11) 

(61.12) 
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This implies that the vertex factor for the first term in eq. (61.10) is given 
by i(ie)[(-ik^) - ( ik M )] = ie(k + 

Putting everything together, we get the following set of Feynman rules 
for tree-level processes in scalar electrodynamics. 

1. For each incoming selectron, draw a dashed line with an arrow pointed 
towards the vertex, and label it with the selectron’s four-momentum, 
hi . 

2. For each outgoing selectron , draw a dashed line with an arrow pointed 
away from the vertex, and label it with the selectron’s four-momentum, 

K- ‘ 

3. For each incoming spositron, draw a dashed line with an arrow pointed 
away from the vertex, and label it with minus the spositron’s four- 
momentum, —hi. 

4. For each outgoing spositron , draw a dashed line with an arrow pointed 
towards the vertex, and label it with minus the spositron’s four- 
momentum, — k[. 

5. For each incoming photon , draw a wavy line with an arrow pointed 
towards the vertex, and label it with the photon’s four-momentum, 

hi . 

6. For each outgoing photon , draw a wavy line with an arrow pointed 
away from the vertex, and label it with the photon’s four-momentum, 

K- ' 

7. There are three allowed vertices, shown in fig. (61.1). Using these 
vertices, join up all the external lines, including extra internal lines as 
needed. In this way, draw all possible diagrams that are topologically 
inequivalent. 

8. Assign each internal line its own four-momentum. Think of the four- 
momenta as flowing along the arrows, and conserve four-momentum 
at each vertex. For a tree diagram, this fixes the momenta on all the 
internal lines. 

9. The value of a diagram consists of the following factors: 
for each incoming photon, e^*(k,;); 

for each outgoing photon, e^.(kj); 

for each incoming or outgoing selectron or spositron, 1; 
for each scalar-scalar-photon vertex, ie(k + £/) M ; 
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Figure 61.2: Diagrams for e + e — > 77. 

for each scalar-scalar-photon-photon vertex, —2 ie 2 g IJ/l ,; 

for each four-scalar vertex, — ?'A; 

for each internal photon, —ig^ v /{k 2 — ie); 

for each internal scalar, — i/{k 2 + m 2 — ie). 


10. The vector index on each vertex is contracted with the vector index 
on either the photon propagator (if the attached photon line is inter- 
nal) or the photon polarization vector (if the attached photon line is 
external) . 

11. The value of iT (at tree level) is given by a sum over the values of all 
the contributing diagrams. 


Let us compute the scattering amplitude for a particular process, e + e~ — 
77, where e~ denotes a selectron. We have the diagrams of fig. (61.2). The 
amplitude is 

iT = ( ig )2 I C 2k 1 -k' 1 )^ l (k 1 -k' 1 -k 2 )u 4 + (1 / ^ 2 /) 

i m 2 — t 

- 2ie 2 g^EyEy , (61.13) 


where t = —{k\ — k[) 2 and u = —(k\ — k' 2 ) 2 ■ This expression can be 
simplified by noting that k\ — k[ — ^2 = k' 2 — 2k 2 , and that k\-e\ = 0. Then 
we have 


T = -e 2 


'4(fc r £i/)(A;2-£2') , / l{k 1 -e 2 '){k 2 -£v) 


m 1 — t 


+ 


m- — u 


+ 2 {£y -£ 2 i) 


(61.14) 


To get the polarization-summed cross section, we take the absolute square 
of eq. (61.14), and use the substitution rule 


H 7(k)7* 00 -* 9'“’ ■ (61.15) 

A=± 


This is a straightforward calculation, which we leave to the problems. 
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Problems 

61.1) Compute (|T| 2 ) for e + e _ — ► 77 , and express your answer in terms of 
the Mandelstam variables. 

61.2) Compute (|T| 2 ) for the process e~y — > e~y. You should find that 
your result is the same as that for e + e _ — > 77 , but with s t, an 
example of crossing symmetry. 
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62 Loop Corrections in Spinor 
Electrodynamics 

Prerequisite: 51, 59 

In this section we will compute the one-loop corrections in spinor electro- 
dynamics. 

First let us note that the general discussion of sections 18 and 29 leads 
us to expect that we will need to add to the free lagrangian 

Cq = {$0* - mW - \F IW F IIV (62.1) 

all possible terms whose coefficients have positive or zero mass dimension, 
and that respect the symmetries of the original lagrangian. These include 
Lorentz symmetry, the U(l) gauge symmetry, and the discrete symmetries 
of parity, time reversal, and charge conjugation. 

The mass dimensions of the fields (in four spacetime dimensions) are 
[A 11 ] = 1 and [4i] = §. Gauge invariance requires that A p appear only in 
the form of a covariant derivative D Y (Recall that the field strength F 
can be expressed as the commutator of two covariant derivatives.) Thus, 
the only possible term we could add to £o that does not involve the 'll 
field, and that has mass dimension four or less, is e^y^F^ F pa . This term, 
however, is odd under parity and time reversal. Similarly, there are no terms 
meeting all the requirements that involve T : the only candidates contain 
either 75 (e.g., ^75^) and are forbidden by parity, or C (e.g, T T C'I') and 
are forbidden by the U(l) symmetry. 

Therefore, the theory we will consider is specified by £ = £o+£i , where 
£0 is given by eq. (62.1), and 

£1 = Z^JfAL + £ ct , (62.2) 

£ct = *(Z 2 -l)tf0tf - (Z m -l)mM - \{Z^l)F pv F^ . (62.3) 

We will use an on-shell renormalization scheme. 

We can write the exact photon propagator (in momentum space) as a 
geometric series of the form 

\ w {k) = A ^(k) + A^AOn^AOA au (k) + . . . , (62.4) 

where iH llu (k) is given by a sum of one-particle irreducible (1PI for short; 
see section 14) diagrams with two external photon lines (and the external 
propagators removed), and A ^ u (k) is the free photon propagator, 

A MI /(fc) = — (g^ ~ ) • (62.5) 
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Here we have used the freedom to add k y or k v terms to put the propa- 
gator into generalized Feynman gauge or R £ gauge. (The name gauge 
has historically been used only in the context of spontaneous symmetry 
breaking — see section 85 — but we will use it here as well. R stands for 
renormalizable and £ stands for £.) Setting £ = 1 gives Feynman gauge, 
and setting £ = 0 gives Lorenz gauge (also known as Landau gauge). 

Observable squared amplitudes should not depend on the value of £. 
This suggests that W iu (k) should be transverse , 

k^ w (k) = k u U^(k) = 0 , (62.6) 

so that the £ dependent term in A^ ,(k) vanishes when an internal photon 
line is attached to TL lxl '(k). Eq. (62.6) is in fact valid; we will give a proof of 
it, based on the Ward identity for the electromagnetic current, in problem 
68.1. For now, we will take eq. (62.6) as given. This implies that we can 
write 


U^(k) = n {k 2 )(k 2 g^ - k^k u ^j (62.7) 

= k 2 U(k 2 )P^{k) , (62.8) 

where n(fc 2 ) is a scalar function, and P^ v {k) = g^ w — k^k u /k 2 is the pro- 
jection matrix introduced in section 57. 

Note that we can also write 

A[j, u (k) = -gy—r^ (Pin>(k) + £ ) • (62.9) 

Then, using eqs. (62.8) and (62.9) in eq. (62.4), and summing the geometric 
series, we find 


A ^(k) 


Pnu{k) t k^K/k 2 

k 2 [l - n(k 2 )} ~ ie + ^ k 2 — ie ' 


(62.10) 


The £ dependent term should be physically irrelevant (and can be set to 
zero by the gauge choice £ = 0, corresponding to Lorenz gauge). The 
remaining term has a pole at k 2 = 0 with residue P fJjl/ (k)/[l — n(0)]. In our 
on-shell renormalization scheme, we should have 


n( 0 ) = 0 . 


(62.11) 


This corresponds to the field normalization that is needed for validity of 
the LSZ formula. 
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Figure 62.1: The one- loop and counterterm corrections to the photon prop- 
agator in spinor electrodynamics. 

Let us now turn to the calculation of Ti^ v {k). The one- loop and coun- 
terterm contributions are shown in fig. (62.1). We have 

/ \ 2 C r i 

= (-1 KiZ ie )\l) J -— iTl [s(t+t)'fSU)'f 

- i(Z 3 -l){k 2 g ^ - k^k u ) + 0(e 4 ) , (62.12) 

where the factor of minus one is for the closed fermion loop, and S{jf) = 
(— q/t+m) / {p 2 +rn? —ie ) is the free fermion propagator in momentum space. 
Anticipating that Z\ = 1 + 0(e 2 ), we set Z\ = 1 in the first term. 


We can write 

r ~ - i C 4N» U 

Tr[s(l+t)'fS(t)'f] = dx , ( 62 . 13 ) 

where we have combined denominators in the usual way: q = t + xk and 

D = x(l—x)k 2 + m 2 — ie . ( 62 . 14 ) 

The numerator is 

4 ( 62 . 15 ) 

Completing the trace, we get 

= (£+k)»r + P{t+k) v - \l{i+k) + m 2 \g ^ . ( 62 . 16 ) 

Setting l = q — xk and and dropping terms linear in q (because they inte- 
grate to zero), we find 

_► 2 q^q v - 2®(1 ~x)U L k v - [q 2 - x(l-x)k 2 + m 2 ]g ^ . ( 62 . 17 ) 

The integrals diverge, and so we analytically continue to d = 4 — e dimen- 


sions, and replace e with eg £ / 2 (so that e remains dimensionless for any 
d). 
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Next we recall a result from problem 14.3, 


d d q (fq\f(q 2 ) = ± (/" j d d qq 2 f(q 2 


(62.18) 


This allows the replacement 


N ^ — ■> —2x(l—x)k fJ 'k u + — l^jq 2 + x(l— x)k 2 — m 2 g^ v . (62.19) 

Using the results of section 14, along with a little manipulation of gamma 
functions, we can show that 


(M/ 


(2it) d ( q 2 + D ) 2 


d d q 1 
( 2 ^)^ (q 2 + D ) 2 


(62.20) 


Thus we can make the replacement (| — 1 )q 2 — >• D in eq. (62.19), and we 
find 

-»• 2x(l—x)(k 2 g fll/ - FU) . (62.21) 

This guarantees that the one-loop contribution to IT"' (A;) is transverse (as 
we expected) in any number of spacetime dimensions. 

Now we evaluate the integral over q , using 


(27r) d ((7 2 + H) 2 167 t 2 


r(f)(4,rAyo) £ 




(62.22) 


where /r 2 = 47 re 7 /I 2 , and we have dropped terms of order e in the last line. 
Combining eqs. (62.7), (62.12), (62.13), (62.21), and (62.22), we get 

n(A 2 ) = -A- [ dxx( 1—x) - — \\n(D / p 2 ) — (Z 3 — 1) + 0(e 4 ) . (62.23) 
7T- 4o L £ J 

Imposing 11(0) = 0 fixes 

e 2 n 1 

Z 3 = l- — j ln(m//x) + 0(e 4 ) (62.24) 

fa7r- Le 


e 2 r 1 

II(&: 2 ) = — / dxx(l— x) ln(H/m 2 ) + 0(e 4 ) . (62.25) 

27T Z Jo 

Next we turn to the fermion propagator. The exact propagator can be 
written in Lehmann-Kallen form as 


S(x0 = 


1 + m — te 


— — I - ds 


pv{s) 


'+ Js - ie 


(62.26) 
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We see that the first term has a pole at ]f = —m with residue one. This 
residue corresponds to the field normalization that is needed for the validity 
of the LSZ formula. 

There is a problem, however: in quantum electrodynamics, the thresh- 
old mass mth is m, corresponding to the contribution of a fermion and a 
zero-energy photon. Thus the second term has a branch point at ]) = — m. 
The pole in the first term is therefore not isolated, and its residue is ill 
defined. 

This is a reflection of an underlying infrared divergence, associated with 
the massless photon. To deal with it, we must impose an infrared cutoff 
that moves the branch point away from the pole. The most direct method 
is to change the denominator of the photon propagator from k 2 to k 2 + m 7 , 
where m 7 is a fictitious photon mass. Ultimately, as in section 26, we must 
deal with this issue by computing cross-sections that take into account 
detector inefficiencies. In quantum electrodynamics, we must specify the 
lowest photon energy cj m i n that can be detected. Only after computing cross 
sections with extra undectable photons, and then summing over them, is 
it safe to take the limit m 7 — ► 0. It turns out that it is not also necessary 
to abandon the on-shell shell renormalization scheme (as we were forced to 
do in massless y? 3 theory in section 27), as long as the electron is massive. 

An alternative is to use dimensional regularization for the infrared di- 
vergences as well as the ultraviolet ones. As discussed in section 25, there 
are no soft-particle infrared divergences for d > 4 (and no colinear diver- 
gences at all in quantum electrodynamics with massive charged particles). 
In practice, infrared-divergent integrals are finite away from even-integer 
dimensions, just like ultraviolet-divergent integrals. Thus we simply keep 
d = 4 — e all the way through to the very end, taking the e — ► 0 limit 
only after summing over cross sections with extra undetectable photons, all 
computed in 4 — e dimensions. This method is calculationally the simplest, 
but requires careful bookkeeping to segregate the infrared and ultraviolet 
singularities. For that reason, we will not pursue it further. 

We can write the exact fermion propagator in the form 

S(^) _1 = lj+ m - ie - £(#0 , (62.27) 

where iE(^/) is given by the sum of 1PI diagrams with two external fermion 
lines (and the external propagators removed). The fact that S(^/) has a pole 
at = —m with residue one implies that £(— m) = 0 and T/{—m) = 0; this 
fixes the coefficients Z 2 and Z m . As we will see, we must have an infrared 
cutoff in place in order to have a finite value for S r (— m). 

Let us now turn to the calculation of E(jf). The one- loop and counter- 
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Figure 62.2: The one- loop and counterterm corrections to the fermion prop- 
agator in spinor electrodynamics. 

term contributions are shown in fig. (62.2). We have 


im = (iZie) 2 (l 




- i(Z 2 -l)rf - i(Z m -l)m + 0(e 4 ) . 
It is simplest to work in Feynman gauge, where we take 


(62.28) 


A^(£) = 


£ 2 + m 2 — ie 


(62.29) 


here we have included the fictitious photon mass m 7 as an infrared cutoff. 
We now apply the usual bag of tricks to get 


= e 2 p £ J dx J 


1 f d d q N 

J (2^p ( q 2 + D) 2 


- i(Z 2 - i(Z m -l)m + 0(e 4 ) 


where q = £ + xp and 


D = x(l— x)p 2 + xm 2 + (1 —x)m 2 , 
N = + 

= - (d— 2) {$ + () — dm 
= -(d- 2)[<k + (l-x)rf\ - dm , 


(62.30) 


(62.31) 


(62.32) 


where we have used (from section 47) 7 M 7 M = — d and 7^7 M = (d— 2)]f. 
The term linear in q integrates to zero, and then, using eq. (62.22), we get 


J dx{(2— e)(l— x)tf+ (4— e)m^j - — | In (D/p 2 ) 


- (Z 2 -l)rf - (Z m -l)m + 0(e 4 ) . 


(62.33) 
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Figure 62.3: The one-loop correction to the photon-fermion-fermion vertex 
in spinor electrodynamics. 


We see that finiteness of E(jf) requires 



e 2 / 1 \ 

Z 2 = 1 - ^ 2(7 + finite J + 0(e 4 ) , 

(62.34) 


Z m = 1 - 7^3 (7 + fi nite j + °( e4 ) ■ 

(62.35) 

We can impose 

S(— m) = 0 by writing 


= w 

J dx [pi—x)]f + 2iPj\n(D / D q) + «2(V + m ) 

+ 0(e A ) , 



(62.36) 

where Do is D evaluated at p 2 = —m 2 , 



Dq = x 2 m 2 + (1— x)m 2 , 

(62.37) 


and K 2 is a constant to be determined. We fix K 2 by imposing £'(— m) = 0. 
In differentiating with respect to rf, we take the p 2 in D , eq. (62.31), to be 
—Ip", we find 


K 2 = — 2 / dxx(l— x 2 )m 2 /Do 
Jo 

= —2 In (m/m 7 ) + 1 , (62.38) 

where we have dropped terms that go to zero with the infrared cutoff m 7 . 

Next we turn to the loop correction to the vertex. We define the vertex 
function ,p) as the sum of one-particle irreducible diagrams with one 

incoming fermion with momentum p, one outgoing fermion with momentum 
p' , and one incoming photon with momentum k = p'—p. The original vertex 
iZ is the first term in this sum, and the diagram of fig. (62.3) is the 
second. Thus we have 


i\^(p,p) = iZiep + iVu oop {p',P ) + 0(e 5 ) , 


(62.39) 
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where 

/ \ 3 r i 

Wf loop (p',p) = («) 3 (i) J [l e S(/+f);“SV+t ) . 

(62.40) 

We again use eq. (62.29) for the photon propagator, and combine denomi- 
nators in the usual way. We then get 

Wf loop (p',p) = e 3 / dF 3 j , (62.41) 

where the integral over Feynman parameters is 

J dFs = 2 J dxidx2dx3d(xi+X2+X3—l), (62.42) 

and 

q = i + x\ p + x 2 p , (62.43) 

D = x\(l—xi )p 2 + x 2 (l— x 2 )p 2 — 2 x\X 2 p-p 

+ (xi+X 2 )m 2 + X 3 , (62.44) 

- f + m) 7 m (- j / - f + m)Y 

= lv[~i + xirf - (l-x 2 )/ + m]7^[-^ - (l-xi)^ + x 2 / + m]7 i/ 

= yujl^j"/ 1 ' + + (linear in (/) , (62.45) 

where 

fV p = 7i/[aq^ - (l-x 2 )/ + m]7 M [-(l-xi)^ + x 2 / + mjq 1 ' . (62.46) 

The terms linear in q in eq. (62.45) integrate to zero, and only the first term 
is divergent. After continuing to d dimensions, we can use eq. (62.18) to 
make the replacement 

Ivll^hY ~^\ q2 • (62.47) 

Then we use 7 P 7 P 7 P = (d— 2)q p twice to get 

lAY l iY ->• ^ P Q 2 7 m • (62.48) 

Performing the usual manipulations, we find 

Vf loop (j >',p) = ^ (i - 1 - 1 JdF 3 In (D/p 2 )) 7 " + kjdF 3 tL . 

(62.49) 



62: Loop Corrections in Spinor Electrodynamics 


377 


From eq. (62.39), we see that finiteness of V /< (p' , p) requires 

g 2 / 1 \ 

Zi = 1 - ( - + finite j + 0(e 4 ) . (62.50) 

To completely fix V M (p',p), we need a suitable condition to impose on it. 

We take this up in the next section. 

Problems 

62.1) Show that adding a gauge fixing term — |£ _1 (9C4 M ) 2 to C results in 
eq. (62.9) as the photon propagator. Explain why £ = 0 corresponds 
to Lorenz gauge, d ,J A^ = 0. 

62.2) Find the coefficients of e 2 /e in Zi, 2 , 3 ,m in Re, gauge. In particular, 
show that Z\ = Z -2 = 1 + 0(e 4 ) in Lorenz gauge. 

62.3) Consider the six one-loop diagrams with four external photons (and 
no external fermions). Show that, even though each diagram is log- 
arithmically divergent, their sum is finite. Use gauge invariance to 
explain why this must be the case. 
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63 The Vertex Function in Spinor 
Electrodynamics 

Prerequisite: 62 


In the last section, we computed the one-loop contribution to the vertex 
function V /i (p ' , p) in spinor electrodynamics, where p is the four-momentum 
of an incoming electron (or outgoing positron), and p' is the four-momentum 
of an outgoing electron (or incoming positron). We left open the issue of 
the renormalization condition we wish to impose on V^ l (p\p). 

For the theories we have studied previously, we have usually made the 
mathematically convenient (but physically obscure) choice to define the 
coupling constant as the value of the vertex function when all external four- 
momenta are set to zero. However, in the case of spinor electrodynamics, 
the masslessness of the photon gives us the opportunity to do something 
more physically meaningful: we can define the coupling constant as the 
value of the vertex function when all three particles are on shell: p 2 = p' 2 = 
— m 2 , and q 2 = 0, where q = p' — p is the photon four-momentum. Because 
the photon is massless, these three on-shell conditions are compatible with 
momentum conservation. 

To be more precise, let us sandwich V A ‘(F,p) between the spinor factors 
that are appropriate for an incoming electron with momentum p and an 
outgoing electron with momentum p' , impose the on-shell conditions, and 
define the electron charge e via 


Us'(p')^{p',p)us(p) 


p2_p/2__ m 2 

(p'-p ) 2 = o 


erv(p / )7 / X(p) 


p2_p/2__ m 2 

(p'-p) 2 = o 


(63.1) 


This definition is in accord with the usual one provided by Coulomb’s 
law. To see why, consider the process of electron-electron scattering. Ac- 
cording to the general discussion in section 19, we compute the exact ampli- 
tude for this process by using tree diagrams with exact internal propagators 
and vertices, as shown in fig. (63.1). In the last section, we renormalized 
the photon propagator so that it approaches its tree-level value A ^ u {q) 
when q 2 — ► 0. And we have just chosen to renormalize the electron-photon 
vertex function by requiring it to approach its tree-level value when 
q 2 — * 0, and when sandwiched between external spinors for on-shell incom- 
ing and outgoing electrons. Therefore, as q 2 — > 0, the first two diagrams 
in fig. (63.1) approach the tree-level scattering amplitude, with the electron 
charge equal to e. Furthermore, the third diagram does not have a pole at 
q 2 = 0, and so can be neglected in this limit. Physically, q 2 —> 0 means 
that the electron’s momentum changes very little during the scattering. 
Measuring a slight deflection in the trajectory of one charged particle (due 
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Figure 63.1: Diagrams for the exact electron-electron scattering amplitude. 
The vertices and photon propagator are exact; external lines stand for the 
usual u and u spinor factors, times the unit residue of the pole at p 2 = —m 2 . 

to the presence of another) is how we measure the coefficient in Coulomb’s 
law. Thus, eq. (63.1) corresponds to this traditional definition of the charge 
of the electron. 

We can simplify eq. (63.1) by noting that the on-shell conditions actually 
enforce p' = p. So we can rewrite eq. (63.1) as 

u s (p)V^{p,p)u s (jp) = eu s (p)^u s {p) 

= 2ej/ , (63.2) 

where p 2 = — m 2 is implicit. We have taken s' = s, because otherwise the 
right-hand side vanishes (and hence does not specify a value for e). 

Now we can use eq. (63.2) to completely determine V M (p',p). Using the 
freedom to choose the finite part of Z\ , we write 

V»(j/,p) = erf-^ I dF 3 (HD/D 0 )+2K 1 )r-^ +0(e 5 ), (63.3) 
where 

D = Xl (1 — X\)p 2 +X2(1— X2 )p 2 — 2 x\X 2 p-p 

+ (xi+x 2 )m 2 + x 3 m 2 , (63.4) 

Dq is D evaluated at p' = p and p 2 = — m 2 , 

Do = (xi+x 2 ) 2 m 2 + x 3 m 2 

= (1 -x 3 ) 2 m 2 + x 3 m 2 , (63.5) 

and 


AT = y v \ X \ij) — (l-x 2 )/ + m]7 M [-(l-xi)^ + x 2 /m]7 iy ; 


(63.6) 
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N M was called N M in section 62, but we have dropped the tilde for notational 
convenience. 

We fix the constant K\ in eq. (63.3) by imposing eq. (63.2). This yields 


4k, i = / dF 3 


u s (p)NqU s (p) 
2 Do 


where is with p' = p and p 2 = p ' 2 = —m 2 . 

So now we must evaluate uN^u. To do so, we first write 

N M (i^i +m)7 M (^ 2 +?n)7 I/ , 


(63.7) 


(63.8) 


where 


ai = xip - (1 -x 2 )p , 

a 2 = x 2 p' - (1-xi )p • (63.9) 

Now we use the gamma matrix contraction identities to get 

N ** = 2ji 2 y IJ '{i l + 4?n(ai+a 2 ) M + 2m 2 7 M . (63.10) 


Here we have set d = 4, because we have already removed the divergence 
and taken the limit e — > 0. Setting p' = p. and using rfu = —mu and 
wjf = —mu, along with uy^u = 2 p^ and uu = 2 m, and recalling that 
X 1 +X 2 +X 3 = 1, we find 

uNqU = 4(1— 4^3 +x\)m 2 p ,x . (63.11) 

Using eqs. (63.5), (63.7), and (63.11), we get 

= 1 7 l-4a 3 +x§ 

Kl 2 J 3 (l-x 3 ) 2 + x 3 m 2 /m? 

[ l M N 1— 4x 3 +x 3 

= / dx 3 1-X 3 )- rj- 77 — 2 

Jo (1— x 3 )- + x 3 m z /m z 

= -2In(m/m 7 ) + § (63.12) 

in the limit of m 7 — > 0. We see that an infrared regulator is necessary for 
the vertex function as well as the fermion propagator. 

Now that we have V /J (// ,p), we can extract some physics from it. Con- 
sider again the process of electron-electron scattering, shown in fig. (63.1). 
In order to compute the contributions of these diagrams, we must evalu- 
ate u s '(p')\ IJ '(p' ,p)u s (p) with p 2 = —p ' 2 = —m 2 , but with q 2 = ( p • — p ) 2 
arbitrary. 
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To evaluate u'N^u, we start with eq. (63.10), and use the anticommu- 
tation relations of the gamma matrices to move all the ]/>'s in AT to the far 
right (where we can use q)u = —mu) and all the jf h s to the far left (where 
we can use u’]/' = —mu'). This results in 

AT — > [4(1— x\— X 2 +x\X 2 )p-p + 2(2xi— x 2 +2x 2 — x^fm 2 ]^ 

+ 4m(x 2 — x 2 +xix 2 )p M + 4?n(x2— xi+xix 2 )pT ■ (63.13) 

Next, replace p-p' with q 2 —m 2 , group the and pT terms into p' + p 

and p' — p combinations, and make use of xi+x 2 +x 3 = 1 to simplify some 
coefficients. The result is 


AT -»• 2[(1— 2x 3 — x§)m 2 - (x 3 +xix 2 )g 2 ]7 M 

- 2m(x 3 -x|)(p' +pY 

- 2m[(xi+x 2 ) - (x 2 +x| )\(p' - pY . (63.14) 

In the denominator, set p 2 = p' 2 = —m 2 and p-p' = —\q 2 —m 2 to get 

D — > x\X 2 q 2 + (1— x 3 ) 2 m 2 + x 3 ?n 2 . (63.15) 


Note that the right-hand side of eq. (63.15) is symmetric under x 3 x 2 . 
Thus the last line of eq. (63.14) will vanish when we integrate u'N^u/D 
over the Feynman parameters. Finally, we use the Gordon identity from 
section 38, 

u'(p' + pYu = u\lm r )' L + 2iS^q u ]u , (63.16) 

where = f[7 / \7 1 '], to get 

AT — > 2[(1— 4x 3 +x|)m 2 — (x 3 +xix 2 )^ 2 ] 7 M 
— 4 im{x^—x\) S^ v q u . 


So now we have 

,p)u s (p) = eu Fi(g 2 ) 7 ^ - j n F 2 (q 2 )S' lv q v 
where we have defined the form factors 


(63.17) 

(63.18) 


Fi(q 2 ) = 1 - 


167T 2 


dF 3 


In 1 + 


x 3 x 2 q 2 /m 2 
(l-x 3 ) 2 


+ 


1— 4x 3 +X3 


(1— x 3 ) 2 + x 3 m 2 /m 2 


+ 


(x 3 +xix 2 )g 2 /?n 2 - (1-4x 3 +x 2 ) 
x\ X 2 <f 2 /m 2 + (1— x 3 ) 2 + x 3 m 2 /m 2 


F 2 (q 2 ) = 


8ir 2 


dF 3 


x 3 -xl 


xi x 2 q 2 /m 2 + (1— x 3 ) 2 


+ 0(e 4 ) 


+ 0(e 4 ), (63.19) 


(63.20) 
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We have set m 7 = 0 in eq. (63.20), and in the logarithm term in eq. (63.19), 
because these terms do not suffer from infrared divergences. 

We can simplify ^(g 2 ) by using the delta function in dF 3 to do the 
integral over X 2 (which replaces X 2 with 1— X3— x±), making the change of 
variable x\ = (1— X3 )y, and performing the integral over X3 from zero to 
one; the result is 


F2(q 2 ) 


e 2 I " 1 dy 

87 r 2 Jo 1 — y(l—y)q 2 /m 2 


+ 0 (e 4 ). 


(63.21) 


This last integral can also be done in closed form, but we will be mostly 
interested in its value at q 2 = 0, corresponding to an on-shell photon: 


F 2 (0) = ^ + O(a 2 ), (63.22) 

where a = e 2 /47r = 1/137.036 is the fine-structure constant. We will explore 
the physical consequences of eq. (63.22) in the next section. 


Problems 


63.1) The most general possible form of u'V^(p' ,p)u is a linear combination 
of 7^, //', and pT sandwiched between u ’ and u , with coefficients that 
depend on q 2 . (The only other possibility is to include terms with 75, 
but 75 does not appear in the tree-level propagators or vertex, and so 
it cannot be generated in any Feynman diagram; this is a consequence 
of parity conservation.) Thus we can write 

Us’iv'YV^ip' ,p)u s {v) = eu’lAtq 2 )^ + B(q 2 ){p +pY 

+ C(q 2 )(p' — p)^ l ]u . (63.23) 


a) Use gauge invariance to show that q^u'V^(p' ,p)u = 0, and deter- 
mine the consequences for A, B, and C . 

b) Express F\ and F 2 in terms of A, B, and C. 
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64 The Magnetic Moment of the Electron 

Prerequisite: 63 

In the last section, we computed the one-loop contribution to the vertex 
function V /J ' (j/ , p) in spinor electrodynamics, where p is the four-momentum 
of an incoming electron, and p' is the four-momentum of an outgoing elec- 
tron. We found 

Us'ip^V^ip' ,p)u s (p) = eu' Fi{q 2 )q M - j n F 2 {q 2 )S^ u q v u , (64.1) 

where q = p' — p is the four-momentum of the photon (treated as incoming), 
and with complicated expressions for the form factors Fi(q 2 ) and F 2 (q 2 ). 
For our purposes in this section, all we will need to know is that 

F\ ((]) = 1 exactly, 

^ 2 (0) = ^ + O(a 2 ). (64.2) 

Eq. (64.1) follows from a quantum action of the form 

r = J d 4 x [eTUO)^ + JL-F 2 (0)F^S^V + ...], (64.3) 

where the ellipses stand for terms with more derivatives. The displayed 
terms yield the vertex factor of eq. (64.1) with q 2 = 0. To see this, recall 
that an incoming photon translates into a factor of A ^ ~ £*^e iqx , and there- 
fore of F^ u ~ i(q IJj E*—q l ,£*n)e iqx - the two terms in F^ v cancel the extra factor 
of one half in the second term in eq. (64.3). 

Now we will see what eq. (64.3) predicts for the magnetic moment of 
the electron. We define the magnetic moment by the following procedure. 
We take the photon field A ^ to be a classical field that corresponds to a 
constant magnetic field in the z direction: A 0 = 0 and A = (0, Bx , 0). This 
yields Fi- 2 = —F 2 i = B, with all other components of F^ v vanishing. Then 
we define a normalized state of an electron at rest, with spin up along the 
z axis: 

|e) = J dpf{p)b\_( p)|0) , (64.4) 

where the wave packet is rotationally invariant (so that there is no orbital 
angular momentum) and sharply peaked at p = 0, something like 

/( P) ~ exp(— a 2 p 2 /2) (64.5) 

with a <C 1/m. We normalize the wave packet by / dp\f(p)\ 2 = 1; then we 
have (e|e) = 1. 
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Now we define the interaction hamiltonian as what we get from the 
two displayed terms in eq. (64.3), using our specified field A and with the 
form-factor values of eq. (64.2): 

H\ = -eB J d 3 x T [x 7 2 + t^S 12 ^ 4/ . (64.6) 

Then the electron’s magnetic moment g is specified by 

gB = -<e|ifi|e) . (64.7) 

In quantum mechanics in general, if we identify H± as the piece of the 
hamiltonian that is linear in the external magnetic field, then eq. (64.7) 
defines the magnetic moment of a normalized quantum state with definite 
angular momentum in the B direction. 

Now we turn to the computation. We need to evaluate {e\^ aix)^ p{x)\e) . 
Using the usual plane-wave expansions, we have 

(0|6+(p / )4' Q (a:)4' / 3(x)6 1 f (p)| 0 ) = u+{p) a u + (p)g e l{ - p ~ p ' )x . (64.8) 

Thus we get 

(e|i7i|e) = — eB J dp dp' d 3 x edv~v') x 

x r(p / )^+(p , )[®7 2 + 2^7 5l2 ] u +(p)/(p) • ( 64 - 9 ) 

We can write the factor of x as —id Pl acting on e i ^ p ~ p '^ x , and integrate 
by parts to put this derivative onto u+(p)/(p); the wave packets kill any 
surface terms. Then we can complete the integral over d 3 x to get a factor 
of (27r) 3 d 3 (p / — p), and do the integral over dp' . The result is 

(e|i7i|e) = -eB j ^ /*(p)«+(p)[*7 2 ^ P1 + 2 ^ 5 ' 12 ] w +(p)/(p) • 

(64.10) 

Suppose the d pi acts on /( p). Since /( p) is rotationally invariant, the 
result is odd in p \ . We then use 7t_|_(p)7*u_)_(p) = 2 p l to conclude that this 
term is odd in both p\ and p 2 , and hence integrates to zero. 

The remaining contribution from the first term has the d pi acting on 
u_|_(p). Recall from section 38 that 

u s { p) = exp(ir/p-K)u s (0) , (64.11) 

where Ri = SA ] = | 7 :, 7 0 is the boost matrix, p is a unit vector in the p 
direction, and rj = sinh _1 (|p|/m) is the rapidity. Since the wave packet is 
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sharply peaked at p = 0, we can expand eq. (64.11) to linear order in p, 
take the derivative with respect to pi, and then set p = 0; the result is 

3 P1 w+(p)| p=o = m Klu +(° ) 

= 

= -i 7 V( 0), (64.12) 

where we used 7°rt s (0) = u s ( 0) to get the last line. Then we have 
u+(p)i7 2 9 pi n+(p) = n+(0)2^7 2 7 1 r t+ (0) 

= |« + (0)S 12 U+ (0) (64.13) 

Plugging this into eq. (64.10) yields 

(e|Bi|e) = -eB J h)|/(p)| 2 (l + jg:) i>i + (0)S 12 u + (0) 

= -^( 1 + K ) a + (0)Sl2 “+ (0) ' ( 6414 ) 

Next we use 5 12 u±(0) = ±^ii±(0) and «±(0)u±(0) = 2m to get 

<e|^|e) = - + . (64.15) 

Comparing with eq. (64.7), we see that the magnetic moment of the electron 
is 

(64.16) 

where e/2 m is the Bohr magneton , the extra factor of one-half is for the 
electron’s spin (a classical spinning ball of charge would have a magnetic 
moment equal to the Bohr magneton times its angular momemtum), and 
g is the Lande g factor, given by 

g = 2(l + ^ + 0{a 2 )) . (64.17) 

Since g can be measured to high precision, calculations of p, provide a 
stringent test of spinor electrodynamics. Corrections up through the a 4 
term have been computed; the result is currently in good agreement with 
experiment. 

Problems 

64.1) Let the wave packet be /( p) ~ exp(— a 2 p 2 /2)l/ m (p), where Y^ m ( p) 
is a spherical harmonic. Find the contribution of the orbital angular 
momentum to the magnetic moment. 
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65 Loop Corrections in Scalar 
Electrodynamics 

Prerequisite: 61, 62 


In this section we will compute the one-loop corrections in scalar electro- 
dynamics. We will concentrate on the divergent parts of the diagrams, 
enabling us to compute the renormalizing Z factors in the MS scheme, and 
hence the beta functions. This gives us the most important qualitative in- 
formation about the theory: whether it becomes strongly coupled at high 
or low energies. 

Our lagrangian for scalar electrodynamics in section 61 already includes 
all possible terms whose coefficients have positive or zero mass dimension, 
and that respect Lorentz symmetry, the U(l) gauge symmetry, parity, time 
reversal, and charge conjugation. Therefore, the theory we will consider is 


£ = £o + £i, (65.1) 

£o = - mW - jF^Ffjv , (65.2) 

jC\ = iZie[(p^ d^(p — — Z^e 2 ^ p>A^ 1 A^ 

~ + C ct , (65.3) 

£ct = -(Z 2 -l)0VfyP - ( Z m - l)mW - \( Z 3 - 1 ) F ^ F ^ . (65.4) 


We will use the MS renormalization scheme to fix the values of the Z's. 

We begin with the photon self-energy, LPL/c). The one- loop and coun- 
terterm contributions are shown in fig. (65.1). We have 


in^ife) 


( iZ\e 




dH 


(2i + ky(2i + ky 


+ (-2 iZ 4 )e 2 g^ j 


(2-7r) 4 ((£+k) 2 + m 2 ){£ 2 + m 2 ) 

dH 1 


(27t) 4 £ 2 + m 2 
-i{Z 3 -\){k 2 g^ v -kV) + ... , 


(65.5) 


where the ellipses stand for higher-order (in e 2 and/or A) terms. We can 
set Zi = 1 + 0(e 2 , A) in the first two terms. 

It will prove convenient to combine these first two terms into 


in'*" (A:) 


=2 /' d 4 £ N^_ 

J (27 r) 4 ((£+k) 2 + m 2 ){£ 2 + m 2 ) 

-i(Z 3 -l)(k 2 g^-kV) + ... , 


(65.6) 
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where 

= (2 1 + k)^(2£ + k) v - 2 \{i+k) 2 + m 2 }g ^ . (65.7) 

Then we continue to d dimensions, replace e with e/f £//2 , and combine the 
denominators with Feynman’s formula; the result is 


ili^^k) = e 2 g £ [ dx 
Jo 


1 r d d q N fiu 

dX J Wy ( q 2 + D) 2 


- i(Zi-l){k 2 g^ u - kV) + 


(65.8) 


where q = l + xk and D = x(A— x)k 2 + m 2 . The numerator is 

= (2q + (1— 2x)kY(2q + (1— 2x)k) u — 2[{q + (1— x)k) 2 + m 2 ]g ,J ' l/ 

= 4 q^y + (1—2 x) 2 k^ L k v — 2 [q 2 + (1— x) 2 k 2 + m 2 ]g^' 1 ' 

+ (linear in q) 

— > Ad^g^q 2 + (1—2 x) 2 k^k u — 2[g 2 + (1— x) 2 fc 2 + m 2 ]g^ 11 ' , (65.9) 

where we used the symmetric-integration identity from problem 14.3 to get 
the last line. We can rearrange eq. (65.9) into 

= 2(| - 1 )</V + (1-2x) 2 W - 2[(1 -x) 2 k 2 +m 2 ]g^ . (65.10) 

Now we recall from section 62 that, when q 2 is integrated against (q 2 +D)~ 2 , 
we can make the replacement (4 — 1 )q 2 — > D ; thus we have 


N» v -> 2D^ M/ + (l—2x) 2 k fJ, k u - 2[(l-x) 2 fc 2 + m 2 ]^ 
= (l-2x) 2 Fr - 2(l-2z)(l -x)k 2 g^ . 


(65.11) 


Next we note that if we make the change of variable x = y + then we 
have D = (1 — |y 2 )/c 2 + ?n 2 , and y is integrated from — J to + J. Therefore, 
any term in IV 7 "' that is even in y will integrate to zero. We then get 


TV'"' = Ay 2 k^k u - 2{2y 2 -y)k 2 g^ 
-»• - Ay 2 [k 2 g t"' - kPh?) . 


(65.12) 


Thus we see that IF"' (A;) is transverse, as expected. 

Performing the integral over q in eq. (65.8), and focusing on the diver- 
gent part, we get 


d q — = — - + O(e 0 ) 

(2n) d ( q 2 + D ) 2 87t 2 e 


(65.13) 
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Figure 65.1: The one- loop and counterterm corrections to the photon prop- 
agator in scalar electrodynamics. 



Figure 65.2: The one-loop and counterterm corrections to the scalar prop- 
agator in scalar electrodynamics. 

Then performing the integral over y yields 

rl/2 

/ dy N IJjy = - \{k 2 g ^ - k^k v ) . (65.14) 

J- 1/2 

Combining eqs. (65.8), (65.13), and (65.14), we get 

W v {k) = U{k 2 ){k 2 g ^ - k IJ k v ) , (65.15) 

where 

e 2 1 

U(k 2 ) = - — — r - + finite - (Z 3 - 1) + . . . . (65.16) 

247T z £ 

Thus we find, in the MS scheme, 

= + < 6517 > 


Now we turn to the one-loop corrections to the scalar propagator, shown 
in fig. (65.2). It will prove very convenient to work in Lorenz gauge, where 
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the photon propagator is 

VW = > ( 65 - 18 ) 


with P/,m(L) = The diagrams in fig. (65.2) then yield 


m^k 2 ) = {iz ie ) 2 (if 


dH P^(£)(£ + 2kr(£ + 2k) u 

(27 r) 4 £ 2 {{£+k) 2 + m 2 ) 


+ (—2 iZ 4 e 2 g^ 


dH P^i) 

(27t) 4 i 2 + m 2 


+ 


dH 1 

(27t) 4 £ 2 + m 2 


— i(Z2~ l)k 2 — i(Z m — l)m 2 + . . . . (65.19) 


In the second line, m 7 is a fictitious photon mass; it appears here as an 
infrared regulator. 

We can set Z) = 1 + 0(e 2 , A) in the first three lines. Continuing to d 
dimensions, making the replacements e — > eg 8 / 2 and A — > A/2 e , and using 
the relations £J'P IIV (£) = £ U P^{£) = 0 and g^ P^ IU (£) = d — 1, we get 


iU v (k 2 ) 


4e 2 fi £ J 
— 2(d—l)e 2 p £ J 


dH P^{£)Wk 


(2n) d £ 2 ((£+k) 2 + m 2 ) 


dH 1_ 

(2ir) d £ 2 + m 2 



dH 

(27t) 4 t 2 + m 2 


- i(Z- 2 -l)k 2 - i(Z m -l)m 2 + . . . . 


(65.20) 


We evaluate the second and third lines via 




£ 


dH i_ 

(2ir) d £ 2 + m 2 


sM"* 2+0 < £ °> 


(65.21) 


Then, taking the limit m 2 — > 0 (with e fixed) in eq. (65.21) shows that the 
second line of eq. (65.20) vanishes when the infrared regulator is removed. 

To evaluate the first line of eq. (65.20), we multiply the numerator and 
denominator by l 2 and use Feynman’s formula to get 


r dH £ 2 P^{£)¥ l k v r r d d g N 
J (2ir) d £ 2 £ 2 ((£+k) 2 + m 2 ) J 3 J (2ir) d (q 2 + D) 3 ’ 


(65.22) 
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Figure 65.3: The one-loop corrections to the three-point vertex in scalar 
electrodynamics . 


where q = £ + x%k, D = 2 : 3 ( 1 — x%)k 2 + 2:3 ?n 2 , and 
N = fk 2 - {i-kf 

= (q — 2:3 k) 2 k 2 — ( q-k — X3 k 2 ) 2 
= q 2 k 2 — {q-k) 2 + (linear in q) 

— > t/ 2 A ; 2 — d~ 1 q 2 k 2 . (65.23) 


Now we use 






1 


+ O(e 0 ) . 


(65.24) 


(2ir) d {q 2 + D ) 3 87 t 2 e 

Combining eqs. (65.20-65.24), and requiring 11^ (/c 2 ) to be finite, we find 


„ , 3e 2 1 

22 “ 1+ 8^I + '" 
Z ™ = 1+ 8^I + '" 


(65.25) 

(65.26) 


in the MS scheme. 

Now we turn to the one-loop corrections to the three-point (scalar- 
scalar-photon) vertex, shown in fig. (65.3). In order to simplify the calcu- 
lation of the divergent terms as much as possible, we have chosen a special 
set of external momenta. (If we wanted the complete vertex function, in- 
cluding the finite terms, we would need to use a general set of external 
momenta.) We take the incoming scalar to have zero four-momentum, and 
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the photon (treated as incoming) to have four-momentum k: then, by mo- 
mentum conservation, the outgoing scalar also has four-momentum k. We 
take the internal photon to have four-momentum l. 

Now comes the magic of Lorenz gauge: in the second and third diagrams 
of fig. (65.3), the vertex factor for the leftmost vertex is ie-P, and this is 
zero when contracted with the P pv (l) of the internal photon propagator. 
Thus the second and third diagrams vanish. 

Alas, we will have to do some more work to evaluate the first and fourth 
diagrams. We have 


*V£(fc, 0) = ieZ x k p 


+ (iZie)(-2iZ 4 e 2 S^)(j) 


dH P up (£)(£ + 2k)P 
(27 r) 4 t 2 {(l+k) 2 + m 2 ) 


+ (~iZx\)(iZie)(j) 


dH (21 + kY 

(27 r) 4 (l 2 + m 2 )((£+k) 2 + m 2 ) 


+ ... . 


(65.27) 


We can set Z. t = 1 + 0(e 2 , A) in the second and third lines. We then do the 
usual manipulations; the integral in the third line becomes 



d d q 2q fi + (1 — 2 x)k p 
(2-K) d (q 2 + D ) 2 


(65.28) 


where q = i + xk and D = x(l—x)k 2 + m 2 . The term linear in q vanishes 
upon integration over q, and the term linear in k vanishes upon integration 
over x. Thus the third line of eq. (65.27) evaluates to zero. 

To evaluate the second line of eq. (65.27), we note that, since P vp (P)t p = 
0, it already has an overall factor of k. We can then treat k as infinitesi- 
mal, and set k = 0 in the denominator. We can then use the symmetric- 
integration identity to make the replacement lyd.p/1 2 — > d~ 1 g up in the nu- 
merator. Putting all of this together, and using eq. (65.13), we find 


Q p 2 1 

V£(fc, 0)/e = Z x kP - ^ - k p + O(s 0 ) + . . . , (65.29) 

and so 

3e 2 1 

= 1 + T~2 ~ + ' ' ' ’ ( 65 ' 3 °) 

87 r z e 

in the MS scheme. 

Next up is the four-point, scalar-scalar-photon-photon vertex. Because 
the tree-level vertex factor, —2iZ^e 2 g^ 1 ', does not depend on the external 
four-momenta, we can simply set them all to zero. Then, whenever an 
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Figure 65.4: The nonvanishing one-loop corrections to the scalar-scalar- 
photon-photon vertex in scalar electrodynamics (in Lorenz gauge with van- 
ishing external momenta). 


internal photon line attaches to an external scalar with a three-point vertex, 
the diagram is zero, for the same reason that the second and third diagrams 
of fig. (65.3) were zero. This kills a lot of diagrams; the survivors are shown 
in fig. (65.4). We have 


iVf (0,0,0) 


-2 iZ A e 2 g^ u 


+ (— 2 iZ t ey(Cf 


(27 r) 4 £ 2 (£ 2 + m 2 ) 






d 4 i (2ty{2£y ( ( 

(2^)4 (. e 2 + m 2 ) 3 + 


+ (-iZ x X)(-2iZ 4 e 2 g^ 


dH 1 

(27 r) 4 (£ 2 + m 2 ) 2 


+ ... . 


(65.31) 


The notation + (//<-> z/) in the second and third lines means that we must 
add the same expression with these indices swapped; this is because the 
original and swapped versions of each diagram are topologically distinct, 
and contribute separately to the vertex function. 

As usual, we set Zi = 1 + 0(e 2 , A) in the second through fourth lines. 
After using the symmetric-integration identity, along with eqs. (65.13) and 
(65.24), we can see that the divergent parts of the third and fourth lines 
cancel each other. The first line is easily evaluated with symmetric inte- 
gration and eq. (65.13). Then we have 

Q p 2 1 

vf (0, 0, 0)/e 2 = -2 Z A gr + 4^2 ~ sT + O(e 0 ) + (65.32) 


Z 4 


3e 2 1 

1 + 8^I + "' 


and so 


(65.33) 
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Figure 65.5: The nonvanishing one- loop corrections to the four-scalar ver- 
tex in scalar electrodynamics (in Lorenz gauge with vanishing external mo- 
menta) . 


in the MS scheme. 

Finally, we have the one-loop corrections to the four-scalar vertex. Once 
again, because the tree-level vertex factor, —iZ\ A, does not depend on the 
external four-momenta, we can set them all to zero. Then, whenever an 
internal photon line attaches to an external scalar with a three-point vertex, 
the diagram is zero. The remaining diagrams are shown in fig. (65.5). 

Even though we have set the external momenta to zero, we still have 
to keep track of which particle is which, in order to count the diagrams 
correctly; thus the external lines are labelled 1 through 4. Lines 1 and 2 have 
arrows pointing towards their vertices, and 3 and 4 have arrows pointing 
away from their vertices. The symmetry factor for each of the first three 
diagrams is S = 2; for each of the last two, it is S' = 1. The difference arises 
because the last two diagrams have the charge arrows pointing in opposite 
directions on the two internal propagators, and so these propagators cannot 
be exchanged. 

It is clear that the first two diagrams will yield identical contributions 
to the vertex function (when the external momenta are all zero). Similarly, 
except for symmetry factors, the contributions of the last three diagrams 
are also identical. Thus we have 


iV 4¥ , (0,0,0) 


-iZ\ A 

+ (l + l) (-2*^4 e 2 ) 2 (y) 


dH g^P up (l)gP a P pfl (l) 

(27t) 4 (l 2 + m 2 ) 2 



65: Loop Corrections in Scalar Electrodynamics 


394 


+ (i + 1 + i) (-?.Z A A) 2 (i) 

+ ... . 


dH 1 

(27 r) 4 (£ 2 + m 2 ) 2 

(65.34) 


Using the familiar techniques, we find 

3e 4 1 5A 2 1 

V4 v (0, 0, 0) = —iZ\ A + — ^ + • • • ; (65.35) 

and so 

Zi = 1 + (^A + T^)u ■" (6536) 

in the MS scheme. 


Problems 

65.1) What conditions should be imposed on V 3 (jA/j) and (k,p' ,p) 
in the OS scheme? (Here k is the incoming four-momentum of the 
photon at the p vertex, and p' and p are the four-momenta of the 
outgoing and incoming scalars, respectively.) 

65.2) Consider a gauge transformaton A ^ —> A M — d^T. Show that there 
is a transformation of p that leaves the lagrangian of eqs. (65.1-65.4) 
invariant if and only if Z 4 = Z 4 /Z 2 . 
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66 Beta Functions in Quantum 
Electrodynamics 

Prerequisite: 52, 62 

In this section we will compute the beta function for the electromagnetic 
coupling e in spinor electrodynamics and scalar electrodynamics. We will 
also compute the beta function for the y? 4 coupling A in scalar electrody- 
namics. 

In spinor electrodynamics, the relation between the bare and renormal- 
ized couplings is 

e 0 = Z 3 " 1/2 Z 2 - 1 Zi /} £/2 e . (66.1) 

It is convenient to recast this formula in terms of the fine-structure constant 
a = e 2 1 At: and its bare counterpart «o = e o/47r, 


a o - 

= Z^Z^ 2 ZU/a. 

(66.2) 

From section 62, we have 



Zi = 


(66.3) 

Z 2 = 


(66.4) 

Z 3 = 

2 a 1 _ , Ox 

1 - b O(o ) , 

37T £ 

(66.5) 


in the MS scheme. Let us write 

In (zyzyz?) = £ AL) . (66.6) 

71=1 £ 

Then we have 

In o?o = ^ n [ l — + In a + e In fl . (66.7) 

71=1 £ 

From eqs. (66.3-66.5), we get 

E\(a) = b 0(a 2 ) . (66.8) 

37T 

Then, the general analysis of section 28 yields 

/ 3(a) = a 2 E[(a) , (66.9) 

where the prime denotes differentiation with respect to a. Thus we find 
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in spinor electrodynamics, We can, if we like, restate this in terms of e as 

0 ( e ) = -^2 + °( e5 ) ' ( 66 . 11 ) 

To go from eq. (66.10) to eq. (66.11), we use a = e 2 /47r and a = ee/27r, 
where the dot denotes d/dln ji. 

The most important feature of either eq. (66.10) or eq. (66.11) is that 
the beta function is positive: the electromagnetic coupling in spinor elec- 
trodynamics gets stronger at high energies, and weaker at low energies. 

It is easy to generalize eqs. (66.10) and (66.11) to the case of N Dirac 
fields with electric charges Qie. There is now a factor of for each field, 
and of Z\i for each interaction. These are found by replacing a in eqs. (66.3) 
and (66.4) with Q 2 a. Then we find Z\ijZn = 1 + 0(a 2 ), so that this ratio 
is universal, at least through O(a). In fact, as we will see in section 67, 
Z\ifZii is always exactly equal to one, and so it always cancels in eq. (66.6). 
As for Z3, now each Dirac field contributes separately to the fermion loop 
in the photon self-energy, and so we should replace a in eq. (66.5) with 
J2i Q\ol. Thus we find that the generalization of eq. (66.11) is 

0(e) = Sr2 g3 + Q(e5) • (66 ' 12) 

Now we turn to scalar electrodynamics. (Prerequisite: 65.) The rela- 
tions between the bare and renormalized couplings are 

e 0 = Z~ XI ' 2 Z~ X Z X jl £/2 e . (66.13) 

el = Z^ 2 Z~ 2 Z 4 jl £ e 2 . (66.14) 

Ao = Z~ 2 Z\p, £ X . (66.15) 

We have two different relations between e and eo, coming from the two 
types of vertices. We can guess (and will demonstrate in section 67) that 
these two renormalizations must work out to give the same answer. Indeed, 
from section 65, we have 


Zi 

Z 2 


1 3e 2 1 

1 + 8?I + " 


3e 2 1 

1 + 8^I + " 


Z3 — 1 — 


e 2 1 
247T 2 £ ^ 


(66.16) 

(66.17) 


(66.18) 
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„ , 3e 2 1 

Zt ~ 1+ 8^1 + --- ' 

(66.19) 

„ , / 3e 4 5A \ 1 

Za - 1 + (_2^A + 16ir 2 j e + ' ■ ■ ’ 

(66.20) 

in Lorenz gauge in the MS scheme; the ellipses stand for higher powers of 
e 2 and/or A. We see that Z\ = Z 2 = Z4, at least through 0(e 2 ,A). The 
correct guess is that this is true exactly. Thus eqs. (66.13) and (66.14) both 
collapse to eo = Z 3 1/<2 e, just as in spinor electrodynamics. 

Thus we can write 

ln ( z -‘ /2 ) = g EJ0) _ 

n = 1 

(66.21) 

, n ( z - 2 z A ) = £ LLh , 

71=1 

(66.22) 

Then we have 


lne 0 = n + Ine + \elnjj, , 

n=l £ 

(66.23) 

In Ao = ^ " - n — - + In A + e In fl . 

n=l 6 

(66.24) 

Using eqs. (66.17), (66.18) and (66.20), we have 


^ l(e,A) “ 24vr 2 + ”' ’ 

(66.25) 

Ti(e, A) — 2 24e 12e ) T • • • 

, (66.26) 

Now applying the general analysis of section 52 yields 


e 3 

&(e,A) - 48 ^ 2 + ... , 

(66.27) 

/?A ( e 5 — i 67r 2 (5A 6Ae + 24e j + . . . . 

(66.28) 

Both right-hand sides are strictly positive, and so both e and A become 
large at high energies, and small at low energies. 

Generalizing eq. (66.25) to the case of several complex scalar fields with 
charges Qie works in the same way as it does in spinor electrodynamics. 
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For a theory with both Dirac fields and complex scalar fields, the one-loop 
contributions to Z3 simply add, and so the beta function for e is 

/5 e (e, A) = Q% + k + • • • • (66.29) 

Problems 

66.1) Compute the one- loop contributions to the anomalous dimensions of 
m , T, and A ^ in spinor electrodynamics in Feynman gauge. 

66.2) Compute the one-loop contributions to the anomalous dimensions of 
m, <p, and A M in scalar electrodynamics in Lorenz gauge. 

66.3) Use the results of problem 62.2 to compute the anomalous dimension 
of m and the beta function for e in spinor electrodynamics in R ^ 
gauge. You should find that the results are independent of £. 

66.4) The value of a(M w ). The solution of eq. (66.12) is 

777 T = 7“T ~ HM w /p) , (66.30) 

a{M w ) a[/a) dir “ 

where the sum is over all quarks and leptons (each color of quark 
counts separately), and we have chosen the lL' r± boson mass M w as 
a reference scale. We can define a different renormalization scheme, 
modified decoupling subtraction or DS, where we imagine integrating 
out a field when p is below its mass. In this scheme, eq. (66.30) 
becomes 

777 t = - 7 -t - X) Qi ln[M w /min(mi, p)} , (66.31) 

a(M w ) a(p) 3i r “ 

where the sum is now over all quarks and leptons with mass less than 
M w . For p, < m e , the DS scheme coincides with the OS scheme, and 
we have 

d T T = - - 5Z Qi In , (66.32) 

a(M w ) a 6ir 

where a = 1/137.036 is the fine-structure constant in the OS scheme. 
Using m u = rn ( i = rn s ~ 300 MeV for the light quark masses (because 
quarks should be replaced by hadrons at lower energies), and other 
quark and lepton masses from sections 83 and 88, compute a(M w ). 
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67 Ward Identities in Quantum 
Electrodynamics I 

Prerequisite: 22, 59 

In section 59, we assumed that scattering amplitudes would be gauge in- 
variant, in the sense that they would be unchanged if we replaced any 
photon polarization vector with + ck where k ^ is the photon’s 
four-momentum and c is an arbitrary constant. Thus, if we write a scat- 
tering amplitude T for a process that includes an external photon with 
four-momentum k ^ as 

T = , (67-1) 

then we should have 

kfMp = 0 . (67.2) 

In this section, we will use the Ward identity for the electromagnetic current 
to prove eq. (67.2). 

We begin by recalling the LSZ formula for scalar fields, 

(f\i) = i J d 4 x i e~ lklXl (— <9 2 + m 2 ) . . . (0|T</?(aq) . . . |0) . (67.3) 

We have treated all external particles as outgoing; an incoming particle has 
k % <0. We can rewrite eq. (67.3) as 

(f\i) = lim (k\ + m 2 ) . . . (0|T</$(&i) . . . |0) . (67.4) 

m 2 

X 

Here (p{k) = i f d 4 x e~ lkx <p(x) is the field in momentum space (with an 
extra factor of i), and we do not fix k 2 = —m 2 . 

We know that the right-hand side of eq. (67.4) must include an overall 
energy-momentum delta function, so let us write 

<0|T£(fci)...|0) = (2 tt ) 4 5 4 {Y ji h)Hklk i -k j ) , (67.5) 

where lF(k 2 ,ki-kj) is a function of the Lorentz scalars k 2 and ki-kj. Then, 
since 

(f\i) = i(27r) 4 6 4 (Y:ik l )T , (67.6) 

eq. (67.4) tells us that T should have a multivariable pole as each k'f ap- 
proaches — m 2 , and that iT is the residue of this pole. That is, near 
k 2 = — m 2 , T takes the form 

*r 

( k 2 + m 2 ) . . . [kl + m 2 ) 


E(k 2 , ki-kj) 


+ nonsingular . 


(67.7) 
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The key point is this: contributions to T that do not have this multivariable 
pole do not contribute to T. 

We have framed this discussion in terms of scalar fields in order to keep 
the notation as simple as possible, but the general point holds for fields of 
any spin. 

In section 22, we analyzed how various classical field equations apply 
to quantum correlation functions. For example, we derived the Schwinger- 
Dyson equations 

s s 

(°I t a - , ( \ . . . <?WOn) |0) 

d(pa{x) 

n 

= i^(O|T0 ai (.xi) . . .d aaj 6 4 (x-Xj) . . . <t>a n (x n ) |0) . (67.8) 

3 = 1 

Here we have used 4> a (x) to denote any kind of held, not necessarily a 
scalar held, carrying any kind of index or indices. The classical equation 
of motion for the held (j) a {x ) is 5S/5f) a ( x) = 0. Thus, eq. (67.8) tells us 
that the classical equation of motion holds for a held inside a quantum 
correlation function, as long as its spacetime argument and indices do not 
match up exactly with those of any other held in the correlation function. 
These matches, which constitute the right-hand side of eq. (67.8), are called 
contact terms. 

Suppose we have a correlation function that, for whatever reason, in- 
cludes a contact term with a factor of, say, <5 4 (.t i— £ 2). After Fourier- 
transforming to momentum space, this contact term is a function of k\ + k 2 , 
but is independent of k\ — hence it cannot take the form of the singular 
term in eq. (67.7). Therefore, contact terms in a correlation function T do 
not contribute to the scattering amplitude T . 

Now let us consider a scattering process in quantum electrodyamics that 
involves an external photon with four-momentum k. In Lorenz gauge (the 
simplest for this analysis), the LSZ formula reads 

</|*> = ^ J d 4 xe- ikx (-d 2 ) . . . (OlTA^z) . . . |0) , (67.9) 

and the classical equation of motion for A ^ is 

BE 

- Z ^ = dM- (67 ' 10) 

In spinor electrodynamics, the right-hand side of eq. (67.10) is Z ij M , where 
j ,L is the electromagnetic current. (This is also true for scalar electrodynam- 
ics if Z4 = Z4/Z2; we saw in problem 65.2 that this condition is necessary 
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for gauge invariance.) We therefore have 

(f\i) = iZ^ 1 Z 1 e^ J d A xe~ lkx . . . (0|Tj /i (x) . . . |0) + contact terms . 

(67.11) 

The contact terms arise because, as we saw in eq. (67.8), the classical equa- 
tions of motion hold inside quantum correlation functions only up to contact 
terms. However, the contact terms cannot generate singularities in the k 2 's 
of the other particles, and so they do not contribute to the left-hand side. 
(Remember that, for each of the other particles, there is still an appropriate 
wave operator, such as the Klein-Gordon wave operator for a scalar, acting 
on the correlation function. These wave operators kill any term that does 
not have an appropriate singularity.) 

Now let us try replacing in eq. (67.11) with AT. We are attempting 
to prove that the result is zero, and we are almost there. We can write the 
factor of ik ^ as — cT acting on the e~ lkx , and then we can integrate by parts 
to get d M acting on the correlation function. (Strictly speaking, we need a 
wave packet for the external photon to kill surface terms.) Then we have 
cT(0|T jn(x) . . . |0) on the right-hand side. Now we use another result from 
section 22, namely that a Noether current for an exact symmetry obeys 
d^jn = 0 classically, and 

cT(0|T . . . |0) = contact terms (67.12) 

quantum mechanically; this is the Ward (or Ward-Takahashi ) identity. 
But once again, the contact terms do not have the right singularities to 
contribute to (f\i). Thus we conclude that (f\i) vanishes if we replace an 
external photon’s polarization vector with its four-momentum AT, quo 
erat demonstratum. 

Reference Notes 

Diagrammatic proofs of the Ward identity in spinor electrodynamics can 
be found in Peskin & Schroeder and Zee. 

Problems 




67: Ward Identities in Quantum Electrodynamics I 


402 


67.2) Show explicitly that the tree-level e + e 
in spinor electrodynamics, 


77 scattering amplitude 


/~j~~ 2 — 

T = e V 2 


-p 1 + W\ + m 


mi 2 — t 

vanishes if e y is replaced with k'( 1 ' 


iv + iy 


—v-i + Wo + m 


m- — u 


u i 
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68 Ward Identities in Quantum 
Electrodynamics II 

Prerequisite: 63, 67 

In this section, we will show that Z\ = Z 2 in spinor electrodynamics, and 
that Z\ = Z 2 = Za, in scalar electrodyanmics (in the OS and MS renormal- 
ization schemes). 

Let us specialize to the case of spinor electrodynamics with a single 
Dirac field, and consider the correlation function 

C^(k,p',p) = iZi I d A xd A yd i ze ikx - ip ' y+ipz {0\Tf(x)^ a (y)^ $ (z)\0) , 

( 68 . 1 ) 

where j 11 = e^'q^'I' is the electromagnetic current. As we saw in section 
67, including Z\j ll (x) inside a correlation function adds a vertex for an 
external photon; the factor of Z\ provides the necessary renormalization of 
this vertex. The explicit fermion fields on the right-hand side of eq. (68.1) 
combine with the fermion fields in the current to generate propagators. 
Thus we have 

C£p(k,p,p) = ( 2-K) A 5 A (k+p-p ) IS (pOiV^(p',p)IS(p) ^ , (68.2) 

where S (p) is the exact fermion propagator, and V^(j/,p) is the exact 1PI 
photon-fermion-fermion vertex function. 

Now let us consider k ll C^g(k,p' ,p). Using eq. (68.1), we can write the 
factor of ik ^ on the right-hand side as d ^ acting on e ikx , and then integrate 
by parts to get — d ^ acting on j M (x). (Strictly speaking, we need a wave 
packet for the external photon to kill surface terms.) Thus we have 

k,C^(k,p',p) = - j 0\Tf(x)^/ a (y)Wp(z)\0) . 

(68.3) 

Now we use the Ward identity from section 22, which in general reads 
-d fl (0\TJ fl (x)(j) ai (aq) . . . (j> an (x n ) |0) 

n 

= i^(0|T<(> ai (aq) . . . 6<f) aj (x)6 4 (x-x j ) . . . (j) an {x n )\D) . (68.4) 

3 = 1 


Here 5(j) a {x) is the change in a field 4> a (x) under an infinitesimal transfor- 
mation that leaves the action invariant, and 


J * 


dC 

d{d^(j) a ) 


Ha 


(68.5) 
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is the corresponding Noether current. In the case of spinor electrodynamics, 

= —ie\ V(x) , 

<54' (x) = +ie4'(x) , (68.6) 

where we have dropped an infinitesimal parameter on the right-hand sides, 
but included a factor of the electron charge e. Using 8C/d(d ^T) = 
and <9£/<9(<9 /i \h) = 0 we find that, with these conventions, the Noether 
current is = Z 2 J U Thus the Ward identity becomes 

-Z 2 d ll (0\Tj^x)* a (y)* /3 (z)\0) = +e<5 4 (*-y)(0|TT a (y)T /3 (z)|0> 

— e <5 4 (x— z)(0|T4' Q (?/)"T i g(2;)|0) . (68.7) 

Recall that 

(0|T4' Q ,(y)T / g(z)|0) = 7 / ^4 S(?W • (68-8) 

Using eqs. (68.7) and (68.8) in eq. (68.3), and carrying out the coordinate 
integrals, we get 

k » C ap( k ’P'iP) = ~iZ2 1 Z 1 (2ir) 4 8 4 (k+p-p') eS (p) - eS(p') ^ . (68.9) 

On the other hand, from eq. (68.2) we have 

KC» p {k,p',p) = -i(27r) 4 h 4 (fc+p-p / )[s(p / )^(p',p)S(p)] a/3 . (68.10) 

Comparing eqs. (68.9) and (68.10) shows that 

(p-p)m S(p')VV,p)S(p) = z^z ie [s(p) - so/)] , (68.11) 

where we have dropped the spin indices. We can simplify eq. (68.11) by 
multiplying on the left by S(p')^ 1 , and on the right by S(p) -1 , to get 

(P-P)^(P,P) = Z 2 - 1 Z 1 e[s(p / )“ 1 - S W 1 ] . (68.12) 

Thus we find a relation between the exact photon-fermion-fermion vertex 
function V /J (p ' , p) and the exact fermion propagator S (p) . 

Since both S (p) and V A( (p',p) are finite, eq. (68.12) implies that Z 1 /Z 2 
must be finite as well. In the MS scheme (where all corrections to Z* = 1 
are divergent), this immediately implies that 


Z\ — Z-2 • 


(68.13) 
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In the OS scheme, we recall that near the on-shell point p 2 = p' 2 = — m 2 
and {p' — p) 2 = 0 we have S (p) = jj> + m and V /4 (p' , p) = eyfo Plugging 
these expressions into eq. (68.12) then yields Z\ = Z2 for the OS scheme. 

To better understand this result, we note that when Z\ = Z 2 , we 
can combine the fermion kinetic term iZy^fy' T and the interaction term 
Z\el> JfEE into where D 11 = d p — ieA p is the covariant derivative. 

Recall that it is D p that has a simple gauge transformation, and so we 
might expect the lagrangian, written in terms of renormalized fields, to 
include d p and A p only in the combination D p . It is still necessary to go 
through the analysis that led to eq. (68.12), however, because quantization 
requires fixing a gauge, and this renders suspect any naive arguments based 
on gauge invariance. Still, in this case, those arguments yield the correct 
result. 

We can make a similar analysis in scalar electrodynamics. We leave the 
details to the problems. 


Reference Notes 

BRST symmetry (see section 74) can be used to derive the Ward identities; 
see Ramond I. 


Problems 


68.1) Consider the current correlation function ( 0\Tj fJ, (x)j u (y)\0 ) in spinor 
electrodynamics. 

a) Show that its Fourier transform is proportional to 

W w (k) + Il fip (k)Ap (T (k)Il <ri/ (k) + . . . . (68.14) 


b) Use this to prove that IP 1 ' (A;) is transverse: k fl W lu (k) = 0. 

68.2) Verify that eq. (68.12) holds at the one-loop level in any renormaliza- 
tion scheme with Z\ = Z-j- 

68.3) Scalar electrodynamics. (Prerequisite: 65.) 

a) Consider the Fourier transform of (0|TJ At (x)<^(y)</?^(2:)|0), where 

J p = — ieZ^X^ d p p — (d p <p*)(p\ — 2Zie 2 A^y^ ip (68.15) 

is the Noether current. You may assume that Z4 = Z 2 /Z2 (which is 
necessary for gauge invariance). Show that 


(p'-p)„V£(p',p) = Z, 


-1 


Z x e 


A(p') 1 - A (p) 


-1 


(68.16) 
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where V 3 (p',p) is the exact scalar-scalar-photon vertex function, and 
A (jp) is the exact scalar propagator. 

b) Use this result to show that Z\ = Z 2 in both the MS and OS 
renormalization schemes. 

c) Consider the Fourier transform of (0|T (x)A u (w)p(y)(p^ (z)\0) . 
Show that 


k^ u (k,p', P ) = zr 1 ; 


vW- k ,p)-vfo',p+ k ) 


(68.17) 


where V 4 v {k,p' ,p) is the exact scalar-scalar-photon-photon vertex 
function, with k the incoming momentum of the photon at the p 
vertex. 
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69 Nonabelian Gauge Theory 

Prerequisite: 24, 58 

Consider a lagrangian with N scalar or spinor fields fifix ) that is invariant 
under a continuous SU(IV) or SO(IV) symmetry, 

fifix) -> Uij(j)j{x) , (69.1) 

where Uij is an N x N special unitary matrix in the case of SU(IV), or an 
N x N special orthogonal matrix in the case of SO (IV). ( Special means 
that the determinant of U is one.) Eq. (69.1) is called a global symmetry 
transformation, because the matrix U does not depend on the spacetime 
label x. 

In section 58, we saw that quantum electrodynamics could be under- 
stood as having a local U(l) symmetry, 

4>(x) — > U{x)<j>{x) , (69.2) 

where U(x) = exp[— ieT{x)\ can be thought of as a 1 x 1 unitary matrix 
that does depend on the spacetime label x. Eq. (69.2) can be a symmetry 
of the lagrangian only if we include a U(1 ) gauge field A M (x), and promote 
ordinary derivatives d M of fi(x) to covariant derivatives D /t = — ieA 

Under the transformation of eq. (69.2), we have 

D^U{x)D^U\x) . (69.3) 

With this transformation rule, a scalar kinetic term like — (D^ip )^ or a 
fermion kinetic term like i'l'^'I', is invariant, as are mass terms like 
and mfl 1 . We call eq. (69.3) a gauge transformation , and say that the 
lagrangian is gauge invariant. 

Eq. (69.3) implies that the gauge field transforms as 

A^x) -> U{x)A^x)U^ (x) + | U(x)d fi U f (x) . (69.4) 

If we use U(x) = exp[— ieT(x)\, then eq. (69.4) simplifies to 

An(x) -»• A m (x) - d,jT(x) , (69.5) 

which is what we originally had in section 54. 

We can now easily generalize this construction of U(l) gauge theory 
to SU(IV) or SO (IV). (We will consider other possibilities later.) To be 
concrete, let us consider SU(IV). Recall from section 24 that we can write 
an infinitesimal SU(A^) transformation as 

U jk (x) = S jk - ig6 a (x)(T a ) jk + O(0 2 ) , 


(69.6) 
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where we have inserted a coupling constant g for later convenience. The 
indices j and k run from 1 to N, the index a runs from 1 to N' 2 — 1 (and 
is implicitly summed), and the generator matrices T a are hermitian and 
traceless. (These properties of T a follow immediately from the special uni- 
tarity of U.) The generator matrices obey commutation relations of the 
form 

[T°, T b ] = if abc T c , (69.7) 

where the real numerical factors f abc are called the structure coefficients of 
the group. If f abc does not vanish, the group is nonabelian. 

We can choose the generator matrices so that they obey the normaliza- 
tion condition 

TV (T a T b ) = \5 ab ; (69.8) 

then eqs. (69.7) and (69.8) can be used to show that f abc is completely 
antisymmetric. For SU(2), we have T a = \o a : where cr a is a Pauli matrix, 
and f abc = e abc , where e abc is the completely antisymmetric Levi-Civita 
symbol. 

Now we define an SU(IV) gauge field A^(x) as a traceless hermitian 
N X N matrix of fields with the gauge transformation property 

A^x) U{x)A lx {x)U\x) + | U(x)djj\x) . (69.9) 

Note that this is identical to eq. (69.4), except that now U(x) is a spe- 
cial unitary matrix (rather than a phase factor), and A^fx) is a traceless 
hermitian matrix (rather than a real number). (Also, the electromagnetic 
coupling e has been replaced by g.) We can write U(x) in terms of the 
generator matrices as 


U{x) = exp[-ic/r a (z)T a ] , (69.10) 

where the real parameters T a (x) are no longer infinitesimal. 

The covariant derivative is 

= dfj, - igA^(x) , (69.11) 

where there is an understood N xN identity matrix multiplying d^. Acting 
on the set of N fields <fi(x) that transform according to eq. (69.2), the 
covariant derivative can be written more explicitly as 

{D^jfx) = d^jix) - igA^x) jk (f k {x) , (69.12) 

with an understood sum over k. The covariant derivative transforms ac- 
cording to eq. (69.3). Replacing all ordinary derivatives in C with covariant 
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derivatives renders C gauge invariant (assuming, of course, that C originally 
had a global S\J(N) symmetry). 

We still need a kinetic term for A^x). Let us define the field strength 


F^{x) = ^[D^D V \ 

(69.13) 

= d^A u — d u A n ig[A^ t , A u ] . 

(69.14) 

Because is a matrix, the final term in eq. (69.14) does not vanish, as 

it does in U(l) gauge theory. Eqs. (69.3) and (69.13) imply that, under a 
gauge transformation, 

F^ix) - U(x)F^(x)U\x) . 

(69.15) 

Therefore, 

£kin = -|Tr (F^F^) 

(69.16) 

is gauge invariant, and can serve as a kinetic term for the SU(1V) gauge 
field. (Note, however, that the field strength itself is not gauge invariant, 
in contrast to the situation in U(l) gauge theory.) 

Since we have taken A^(x) to be hermitian and traceless, we can expand 
it in terms of the generator matrices: 

M*) = A » Ta • 

(69.17) 

Then we can use eq. (69.8) to invert eq. (69.17): 


A a ^x) = 2TrA lx (x)T a . 

(69.18) 

Similarly, we have 


F^x) = F; u T a , 

(69.19) 

F^{x) = 2Tr F, u T a . 

(69.20) 

Using eq. (69.19) in eq. (69.14), we get 


F^T C = (d»Al - d„Al)T c - igA“A b u [T a , T b } 


= (d^Al-d„Al + gr bc A«A b u )T c . 

(69.21) 

Then using eqs. (69.20) and (69.8) yields 


I%, = d li Ai-d v A c fl + gf obc AlA b v . 

(69.22) 

Also, using eqs. (69.19) and (69.8) in eq. (69.16), we get 


p _ _lpcfj,upc 

^km — 4 ^ L fu/ • 

(69.23) 
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From eq. (69.22), we see that C y in includes interactions among the gauge 
fields. A theory of this type, with nonzero f abc , is called nonabelian gauge 
theory or Yang-Mills theory. 

Everything we have just said about SU(1V) also goes through for SO(AT), 
with unitary replaced by orthogonal , and traceless replaced by antisym- 
metric. There is also another class of compact nonabelian groups called 
Sp(27V), and five exceptional compact groups: G(2), F(4), E(6), E(7), and 
E(8). Compact means that Tr (T a T b ) is a positive definite matrix. Non- 
abelian gauge theory must be based on a compact group, because otherwise 
some of the terms in £km would have the wrong sign, leading to a hamil- 
tonian that is unbounded below. 

As a specific example, let us consider quantum chromodynamics , or 
QCD, which is based on the gauge group SU(3). There are several Dirac 
fields corresponding to quarks. Each quark comes in three colors', these 
are the values of the SU(3) index. (These colors have nothing to do with 
ordinary color.) There are also six flavors: up, down, strange, charm, 
bottom (or beauty) and top (or truth). Thus we consider the Dirac field 
4',:/(x), where i is the color index and I is the flavor index. The lagrangian 
is 

C = - m/T/'F, - iTY(F^iV) , (69.24) 

where all indices are summed. The different quark flavors have different 
masses, ranging from a few MeV for the up and down quarks to 178 GeV 
for the top quark. (The quarks also have electric charges: +||e| for the u, 
c, and t quarks, and — ||e| for the d , s, and b quarks. For now, however, we 
omit the appropriate coupling to the electromagnetic field.) The covariant 
derivative in eq. (69.24) is 

(Djij = - igApfj • (69.25) 

The index a on A“ runs from 1 to 8, and the corresponding massless spin- 
one particles are the eight gluons. 

In a nonabelian gauge theory in general, we can consider scalar or spinor 
fields in different representations of the group. A representation of a com- 
pact nonabelian group is a set of finite-dimensional hermitian matrices 
(the R is part of the name, not an index) that obey that same commuta- 
tion relations as the original generator matrices T a . Given such a set of 
-D(R) x D( R) matrices (where D( R) is the dimension of the representa- 
tion), and a field <j>(x) with D{ R) components, we can write its covariant 
derivative as D /( = <9 M — igA^T£, with an understood D( R) x -D(R) identity 
matrix multiplying d^. Under a gauge transformation, 4>{x) — > U R {x)<j>{x), 
where U R (x) is given by eq. (69.10) with T a replaced by Tj“ . The theory 
will be gauge invariant provided that the transformation rule for is in- 
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depedent of the representation used in eq. (69.9); we show in problem 69.1 
that it is. 

We will not need to know a lot of representation theory, but we collect 
some useful facts in the next section. 

Problems 


69.1) Show that eq. (69.9) implies a transformation rule for that is in- 
dependent of the representation used in eq. (69.9). Hint: consider an 
infinitesimal transformation. 

69.2) Show that [T a T a ,T b ] = 0. 
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70 Group Representations 

Prerequisite: 69 

Given the structure coefficients f abc of a compact nonabelian group, a rep- 
resentation of that group is specified by a set of -D(R) x D( R) traceless 
hermitian matrices (the R is part of the name, not an index) that obey 
that same commutation relations as the original generators matrices T a , 
namely 

[T£,T b ]=if abc TC. (70.1) 

The number D( R) is the dimension of the representation. The original 
T a, s correspond to the fundamental or defining representation. 

Consider taking the complex conjugate of the commutation relations, 
eq. (70.1). Since the structure coefficients are real, we see that the matrices 
— (T“)* also obey these commutation relations. If — (Rt)* = T“, or if we can 
find a unitary transformation T“ — > U~ 1 T£U that makes — (T“)* = T“ for 
every a, then the representation R is real. If such a unitary transformation 
does not exist, but we can find a unitary matrix V j -I such that — (T“)* = 
V~ l TffV for every a, then the representation R is pseudoreal. If such a 
unitary matrix also does not exist, then the representation R is complex. 
In this case, the complex conjugate representation R is specified by 


T a = _ (T a r 


(70.2) 


One way to prove that a representation is complex is to show that 
at least one generator matrix T“ (or a real linear combination of them) 
has eigenvalues that do not come in plus-minus pairs. This is the case 
for the fundamental representation of S\J(N) with N > 3. For SU(2), 
the fundamental representation is pseudoreal, because — |cr“, 

but —{\cr a )* = V~ 1 (T}(j a )V with V = oi- For SO(iV), the fundamental 
representation is real, because the generator matrices are antisymmetric, 
and every antisymmetric hermitian matrix is equal to minus its complex 
conjugate. 

An important representation for any compact nonabelian group is the 
adjoint representation A. This is given by 


( T a A ) bc = - if 


abc 


(70.3) 


Because f abc is real and completely antisymmetric, T“ is manifestly her- 
mitian, and also satisfies eq. (70.2); thus the adjoint representation is real. 
The dimension of the adjoint representation D{ A) is equal to the number 
of generators of the group; this number is also called the dimension of the 
group. 
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To see that the T“’s satisfy the commutation relations, we use the Jacobi 
identity 


jabd jdce _|_ j?occi jcLae _|_ jcacL jcwe g 


cbcd rdae 


cad ndbe 


(70.4) 


which holds for the structure coefficients of any group. To prove the Jacobi 
identity, we note that 


TV T e ([[T a ,T h ],T c ] + [[T b ,T c ],T a ] + [[T c , T a ], T 6 ]) = 0 , (70.5) 

where the T“’s are the original generator matrices. That the left-hand 
side of eq. (70.5) vanishes can be seen by writing out all the commutators 
as matrix products, and noting that they cancel in pairs. Employing the 
commutation relations twice in each term, followed by 


Tr(T a T b ) = \5 ab , 


(70.6) 


ultimately yields eq. (70.4). Then, using the antisymmetry of the structure 
coefficients, inserting some judicious factors of i, and moving the last term 
of eq. (70.4) to the right-hand side, we can rewrite it as 


(- if abd )(-if cde ) - (~if cbd )(-if ade ) = if acd (—if dbe ) . 
Now we use eq. (70.3) in eq. (70.7) to get 

( T“) bd (Ti) de - (T c A ) bd {Tl) de = if acd (T d ) be: , 


(70.7) 


(70.8) 


or equivalently [T“,T£] = if acd T d . Thus the T“’s satisfy the appropriate 
commutation relations. 

Two related numbers usefully characterize a representation: the index 
T (R) and the quadratic Casimir C (R) . The index is defined via 

Tr (T£T*) = T{R)6 ab . (70.9) 


Next we recall from problem 69.2 that the matrix commutes with 

every generator, and so must be a number times the identity matrix; this 
number is the quadratic Casimir C(R). It is easy to show that 


T(R)D(A) = C(R)-D(R) . (70.10) 


With the standard normalization conventions for the generators, we have 
T(N) = -y for the fundamental representation of SU (N) and T(N) = 2 for 
the fundamental representation of SO (N). We show in problem 70.2 that 
T( A) = N for the adjoint representation of SU(1V), and in problem 70.3 
that T( A) = 2 N — 4 for the adjoint representation of SO (IV). 

A representation R is reducible if there is a unitary transformation — > 

U~ 1 T£U that puts all the nonzero entries into the same diagonal blocks for 
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each a; otherwise it is irreducible. Consider a reducible representation R 
whose generators can be put into (for example) two blocks, with the blocks 
forming the generators of the irreducible representations Ri and R 2 . Then 
R is the direct sum representation R = Ri © R 2 , and we have 

-D(Ri©R 2 ) = -D(Ri) + -D(R.2) , (70.11) 

T(Ri®R 2 ) = T(Ri) + T(R 2 ) . (70.12) 

Suppose we have a field (fu(x) that carries two group indices, one for 
the representation Ri and one for the representation R 2 , denoted by i and 
I respectively. This field is in the direct product representation Ri <g) R 2 . 
The corresponding generator matrix is 


= (TZJijSu + SijiTgJu , (70.13) 

where i and I together constitute the row index, and j and J together 
constitute the column index. We then have 

H(Ri<g>R 2 ) = D(R 1 )D{R 2 ) , (70.14) 

T(Ri®R 2 ) = T(R 1 )D(R 2 ) + D(Ri)T(R 2 ) . (70.15) 

To get eq. (70.15), we need to use the fact that the generator matrices are 

traceless, (T“)jj = 0. 

At this point it is helpful to introduce a slightly more refined notation for 
the indices of a complex representation. Consider a field ip in the complex 
representation R. We will adopt the convention that such a held carries 
a “down” index: pi, where i = 1, 2, . . . , D(R). Hermitian conjugation 
changes the representation from R to R, and we will adopt the convention 
that this also raises the index on the held, 

(<Pi? = <P U - (70.16) 

Thus a down index corresponds to the representation R, and an up index to 
R. Indices can be contracted only if one is up and one is down. Generator 
matrices for R are then written with the hrst index down and the second 
index up: (T^)^ . An infinitesimal group transformation of tpi takes the 
form 


<Pi - (1 -*0°7£)iVi 

= <Pi-ie a (T£)j<p j . (70.17) 

The generator matrices for R are then given by 

(T^Yi = -m/ , 


(70.18) 
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where we have used the hermiticity to trade complex conjugation for trans- 
position of the indices. An infinitesimal group transformation of tp'* takes 
the form 


^ (1 - i6 a T%) i j <pV 

= - i0 a {T£yj^ j 

= p ]i + *0°( , (70.19) 

where we used eq. (70.18) to get the last line. Note that eqs. (70.17) and 
(70.19) together imply that p' l g>i is invariant, as expected. 

Consider the Kronecker delta symbol with one index down and one up: 
8i J . Under a group transformation, we have 

Si j - (1 + i0 a T“) i fc (l + iPTgyfa 1 

= (i + ie a T^i k 8 k l (i-ie a T^y 

= 5i j + 0{6 2 ) . (70.20) 

Eq. (70.20) shows that 5^ is an invariant symbol of the group. This exis- 
tence of this invariant symbol, which carries one index for R and one for 
R, tells us that the product of the representations R and R must contain 
the singlet representation 1, specified by Tf = 0. We therefore can write 

R<g)R = 1® ... . (70.21) 

The generator matrix (T^)^ , which carries one index for R, one for R, 
and one for the adjoint representation A, is also an invariant symbol. To see 
this, we make a simultaneous infinitesimal group transformation on each of 
these indices, 

- (1 - i0°T“)i fc ( 1 - i9 a T£y t ( 1 - ie a Tl) bc (T£) k l 

= (T b n y - i0“[(T“)^(T*y + (T|)MT^ Z + (T a “) 6c (T^] 

+ 0(6> 2 ). (70.22) 

The factor in square brackets should vanish if (as we claim) the generator 
matrix is an invariant symbol. Using eqs. (70.3) and (70.18), we have 

[...] = (T“)>cr»y - (T R V(T*)/ -if abc (Tcy 

= (T“T*y - (T*T£)j - if abc (T*)i j 

= 0, 


(70.23) 
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where the last line follows from the commutation relations. The fact that 
is an invariant symbol implies that 

R<g>R<g>A = l©... . (70.24) 

If we now multiply both sides of eq. (70.24) by A, and use A 0 A = 1 © . . . 
[which follows from eq. (70.21) and the reality of A], we find R®R = A©. . . . 
Combining this with eq. (70.21), we have 

R®R = IffiA®... . (70.25) 

That is, the product of a representation with its complex conjugate is al- 
ways reducible into a sum that includes (at least) the singlet and adjoint 
representations. 

For the fundamental representation N of SU(IV), we have 

N ® N = 1 © A , (70.26) 

with no other representations on the right-hand side. To see this, recall that 
D{ 1) = 1, D( N) = D( N) = N, and, as shown in section 24, D( A) = N 2 — 1. 
From eq. (70.14), we see that there is no room for anything else on the 
right-hand side of eq. (70.26). 

Consider now a real representation R. From eq. (70.25), with R = R, 
we have 

R®R=1©A©.... (70.27) 

The singlet on the right-hand side implies the existence of an invariant 
symbol with two R indices; this symbol is the Kronecker delta 6^ . It is 
invariant because 

Sij - (1 - i0 a ^)i fc ( 1 - iO a T«)j l 5 kl 

= - iPKTfiij + (T£)ji\ + O(0 2 ) . (70.28) 

The term in square brackets vanishes by hermiticity and eq. (70.18). The 
fact that 8ij — 5. ji implies that the singlet on the right-hand side of eq. (70.28) 
appears in the symmetric part of this product of two identical representa- 
tions. 

The fundamental representation N of SO(IV) is real, and we have 

N <g> N = l s © A a © S s . (70.29) 

The subscripts tell whether the representation appears in the symmetric or 
antisymmetric part of the product. The representation S corresponds to 
a field with a symmetric traceless pair of fundamental indices: ip t j = ipji, 
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(fu = 0, where the repeated index is summed. We have D( 1) = 1, D (N) = 
N, and, as shown in section 24, D( A) = ^N(N-l). Also, a traceless 
symmetric tensor has D( S) = ^N(N+1)— 1 independent components; thus 
eq. (70.14) is fulfilled. 

Consider now a pseudoreal representation R. Since R is equivalent to its 
complex conjugate, up to a change of basis, eq. (70.27) still holds. However, 
we cannot identify 5jj as the corresponding invariant symbol, because then 
eq. (70.28) shows that R would have to be real, rather than pseudoreal. 
From the perspective of the direct product, the only alternative is to have 
the singlet appear in the antisymmetric part of the product, rather than 
the symmetric part. The corresponding invariant symbol must then be 
antisymmetric on exchange of its two R indices. 

An example (the only one that will be of interest to us) is the funda- 
mental representation of SU(2). For SU(-ZV) in general, another invariant 
symbol is the Levi-Civita tensor which carries N fundamental in- 

dices and is completely antisymmetric. It is invariant because, under an 
SU(IV) transformation, 


-11...1N 




in 


JN, 


-31— 3N 


= (det ,.i N ■ 


(70.30) 


Since det U = 1 for SU(IV), we see that the Levi-Civita symbol is invariant. 
We can similarly consider e n '" lAr , which carries N completely antisymmetric 
antifundamental indices. For SU(2), the Levi-Civita symbol is £{j = —£ji\ 
this is the two-index invariant symbol that corresponds to the singlet in the 
product 

2 <g) 2 = 1 A © 3 S , (70.31) 

where 3 is the adjoint representation. 

We can use e 13 and to raise and lower SU(2) indices. This is another 
way to see that there is no distinction between the fundamental represen- 
tation 2 and its complex conjugate 2. That is, if we have a field pi in the 
representation 2, we can get a field in the representation 2 by raising the 
index: ip 1 = £ lJ Pj- 

The structure constants f abc are another invariant symbol. This follows 
from (T“) ftc = — if abc , since we have seen that generator matrices (in any 
representation) are invariant. Alternatively, given the generator matrices 
in a representation R, we can write 

T{R)f abc = —i Tr(T“[T^, T^]) . (70.32) 


Since the right-hand side is invariant, the left-hand side must be as well. 
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If we use an anticommutator in place of the commutator in eq. (70.32), 
we get another invariant symbol, 

A(R)d abc = ±TY (T£{T b , T R C }) , (70.33) 

where A(R) is the anomaly coefficient of the representation. The cyclic 
property of the trace implies that A{R)d abc is symmetric on exchange of 
any pair of indices. Using eq. (70.18), we can see that 

A(R) = -A{R) . (70.34) 

Thus, if R is real or pseudoreal, R(R) = 0. We also have 

R(Ri 0R 2 ) = ^4(Ri) + R(R 2 ) , (70.35) 

A{ Ri 0R 2 ) = A(R 1 )i?(R 2 ) + D(R 1 )A{R 2 ) . (70.36) 

We normalize the anomaly coefficient so that it equals one for the smallest 
complex representation. In particular, for SV(N) with N > 3, the smallest 
complex representation is the fundamental, and R(N) = 1. For SU(2), all 
representations are real or pseudoreal, and 7l(R) = 0 for all of them. 

Reference Notes 

More group and representation theory can be found in Ramond II. 

Problems 


70.1) Verify eq. (70.10). 

70.2) a) Use eqs. (70.12) and (70.26) to compute T( A) for SU(IV). 

b) For SU(2), the adjoint representation is specified by (Tff) bc = 
—ie abc . Use this to compute T( A) explicitly for SU(2). Does your 
result agree with part (a)? 

c) Consider the SU(2) subgroup of SU(V) that acts on the first two 
components of the fundamental representation of SU (N). Under this 
SU(2) subgroup, the N of SU(IV) transforms as 2 0 (N— 2)l’s. Us- 
ing eq. (70.26), figure out how the adjoint representation of SU(V) 
transforms under this SU(2) subgroup. 

d) Use your results from parts (b) and (c) to compute T( A) for SU(fV). 
Does your result agree with part (a)? 
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70.3) a) Consider the SO(3) subgroup of SO(N) that acts on the first three 
components of the fundamental representation of SO(IV). Under this 
SO(3) subgroup, the N of SO(N ) transforms as 3 0 (N— 3)l’s. Us- 
ing eq. (70.29), figure out how the adjoint representation of SO(IV) 
transforms under this SO(3) subgroup. 

b) Use your results from part (a) and from problem 70.2 to compute 
T( A) for SO(JV). 

70.4) a) For SU(-ZV), we have 

N 0 N = A a © <S s , (70.37) 

where A corresponds to a field with two antisymmetric fundamental 
SU(iV) indices, p>ij = —pji, and S corresponds to a held with two 
symmetric fundamental SU(1V) indices, ptj = +Pji- Compute D(A) 
and D(S). 

b) By considering an SU(2) subgroup of SV(N), compute T(A) and 
T(S). 

c) For SU(3), show that .4 = 3. 

d) By considering an SU(3) subgroup of SU(1V), compute A(A) and 

MS). 

70.5) Consider a held p>i in the representation Ri and a held xi i n the 
representation R 2 . Their product PiXi is then in the direct product 
representation Ri 0 R 2 , with generator matrices given by eq. (70.13). 

a) Prove the distribution rule for the covariant derivative, 

[ D ^x)\u = (D^)iXi + ■ (70.38) 

b) Consider a held (pi in the complex representation R. Show that 

d^ l pi) = (D^yipi + tp fl (D lx p))i . (70.39) 

Explain why this is a special case of eq. (70.38). 

70.6) The held strength in Yang-Mills theory is in the adjoint representa- 
tion, and so its covariant derivative is 

{DpF^r = d p F; u - igA c p (T c A y b F» u . (70.40) 

Prove the Bianchi identity , 

{D p F up ) a + ( D v F pp ) a + ( D p F pv ) a = 0 . (70.41) 
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71 The Path Integral for Nonabelian Gauge 

Theory 

Prerequisite: 53, 69 

We wish to evaluate the path integral for nonabelian gauge theory (also 
known as Yang-Mills theory), 

Z(J) oc J VA pWYJ) 5 (71.1) 

Sym(A, J) = j d 4 x [- \F a ^F“ v + . (71.2) 

In section 57, we evaluated the path integral for U(l) gauge theory by 
arguing that, in momentum space, the component of the U(l) gauge field 
parallel to the four-momentum kJ l did not appear in the action, and hence 
should not be integrated over. This argument relied on the form of the 
U(l) gauge transformation, 

A^x) -*■ Afj(x) - 5 M r(x) . (71.3) 

In the nonabelian case, however, the gauge transformation is nonlinear, 

A^x) U{x)A^x)U\x) + ^U(x)d^u\x) , (71.4) 

where A /t (x') = A“(x)T a . For an infinitesimal transformation, 

U(x) = I — igO(x) + 0(9 2 ) 

= / — ig9 a (x)T a + O(0 2 ) , (71.5) 

we have 

An(x) ->• A m ( x) + ig[Afj,(x),6(x)\ - d^9(x) , (71.6) 

or equivalently 

A;(x) -+ A;(x) - gf abc A b l (x)9 c (x) - d,9 a (x) 

= A^-n + gf^AfaWix) 

= A%[x) - [5 ac d, L - igA b ^-if bac )}9 c (x) 

= Al{x)-[5 ac d,-igA b ^Tir]9 c {x) 

= A%{x) - D“ c 9 c (x) , (71.7) 

where D“ c is the covariant derviative in the adjoint representation. We 
see the similarity with the abelian case, eq. (71.3). However, the fact that 
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it is D n that appears in eq. (71.7), rather than means that we cannot 
account for gauge redundancy in the path integral by simply excluding the 
components of that are parallel to k/, . We will have to do something 
more clever. 

Consider an ordinary integral of the form 

Z oc J dx dy e iS{x) , (71.8) 

where both x and y are integrated from minus to plus infinity. Because y 
does not appear in S(x), the integral over y is redundant. We can then 
define Z by simply dropping the integral over y, 

Z = Jdxe iS{x) . (71.9) 


This is how we dealt with gauge redundancy in the abelian case. 

We could get the same answer by inserting a delta function, rather than 
by dropping the y integral: 

Z = J dx dy 6(y) e iS W . (71.10) 

Furthermore, the argument of the delta function can be shifted by an ar- 
bitrary function of x. without changing the result: 


Z= j dx dy S(y — f(x)) e tS ^ . (71.11) 

Suppose we are not given f(x) explicitly, but rather are told that y = f(x) 
is the unique solution, for fixed x, of G(x, y) = 0. Then we can write 


5(G(x,y)) 


<% ~ /(g)) 

\dG/dy\ 


(71.12) 


where we have used a standard rule for delta functions. We can drop the 
absoute- value signs if we assume that dG/dy is positive when evaluated at 
y = f(x). Then we have 


Z= [ dxdy^ 5{G) e iS 
J dy 


(71.13) 


Now let us generalize this result to an integral over d n x d n y. We will need 
n functions Gi(x,y) to fix all n components of y. The generalization of 
eq. (71.13) is 


Z = 


dGj 

. d Vi 




AS 


d n xd n y det 


(71.14) 
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Now we are ready to translate these results to path integrals over non- 
abelian gauge fields. The role of the redundant integration variable y is 
played by the set of all gauge transformations 6 a (x). The role of the inte- 
gration variables x and y together is played by the gauge field Af(x). The 
role of G is played by a gauge-fixing function. We will use the gauge-fixing 
function appropriate for R £ gauge, which is 

G a {x) = d»A“(x) - co a (x) , (71.15) 

where uj a (x) is a fixed, arbitrarily chosen function of x. (We will see how 
the parameter £ enters later.) In eq. (71.15), the spacetime argument x and 
the index a play the role of the index i in eq. (71.14). Our path integral 
becomes 

Z(J) oc J VA det(^) Yl x ,J(G) e iS ™ , (71.16) 

where Sym is given by eq. (71.2). 

Now we have to evaluate the functional derivative 5G a (x)/59 b (y), and 
then its functional determinant. From eqs. (71.7) and (71.15), we find that, 
under an infinitesimal gauge transformation, 

G a (x) G a (x) - d»D“ b d b (x) . (71.17) 

Thus we have 

= -<Wjy (z - „) , (71.18) 

where the derivatives are with respect to x. 

Now we need to compute the functional determinant of eq. (71.18). 
Luckily, we learned how to do this in section 53. A functional determi- 
nant can be written as a path integral over complex Grassmann variables. 
So let us introduce the complex Grassmann field c a (x), and its hermitian 
conjugate c a (x). (We use a bar rather than a dagger to keep the notation 
a little less cluttered.) These fields are called Faddeev-Popov ghosts. Then 
we can write 

det ^S0 b ( X ) rc J e lSsh , (71.19) 

where the ghost action is 5 g h = / d A x C g h, and the ghost lagrangian is 
£ gh = <fd^D ab c b 
= -d^(fiD ab c b 

fj, 

= -d^cfd ^ + gf abc A c ^<?c b . 


(71.20) 
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We dropped a total divergence in the second line. We see that c a (x ) has 
the standard kinetic term for a complex scalar field. (We need the factor of 
i in front of 5 g h in eq. (71.19) for this to work out; this factor affects only 
the overall phase of Z(J), and so we can choose it at will.) The ghost field 
is also a Grassmann field, and so a closed loop of ghost lines in a Feynman 
diagram carries an extra factor of minus one. We see from eq. (71.20) that 
the ghost field interacts with the gauge field, and so we will have such loops. 

Since the particles associated with the ghost field do not in fact exist 
(and would violate the spin-statistics theorem if they did), it must be that 
the amplitude to produce them in any scattering process is zero. This is 
indeed the case, as we will discuss in section 74. 

We note that in abelian gauge theory, where f abc = 0, there is no 
interaction term for the ghost field. In that case, it is simply an extra free 
field, and we can absorb its path integral into the overall normalization. 

We have one final trick to perform. Our gauge-fixing function, G a {x ) 
contains an arbitrary function u> a (x). The path integral Z(J) is, however, 
independent of LO a (x). So, we can multiply Z( J) by an arbitrary functional 
of u, and then perform a path integral over u; the result can change only 
the overall normalization of Z(J). In particular, let us multiply Z(J) by 


exp 



(71.21) 


Because of the delta- functional in eq. (71.16), it is easy to integrate over u. 
The final result for Z(J) is 


Z(J) oc J VAVcVc expos'™ + zS'gh + ^gf) > (71.22) 

where Sym is given by eq. (71.2), S g h is given by the integral over d 4 x of 
eq. (71.20), and S g f (gf stands for gauge fixing) is given by the integral over 
d A x of 

£gf = -IC'WAl&'Al . (71.23) 

In the next section, we will derive the Feynman rules that follow from 
this path integral. 
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72 The Feynman Rules for Nonabelian Gauge 

Theory 

Prerequisite: 71 

Let us begin by considering nonabelian gauge theory without any scalar or 
spinor fields. The lagrangian is 

l ym — 4-C r fxv 

= -\{d^A ev - d v A e » + g f abe A a ^A bu )(d p A e l , - ft, A* + gf^A^At) 

= -\d»A ev d p A e v + \& l A ev d v A e p 

- gf abe A atl A bu d^Al - \g 2 f abe f cde A a ' l A b ''A c tl Ai . (72.1) 

To this we should add the gauge-fixing term for R ^ gauge, 

£ gf = -lC 1 d^d I 'Al. (72.2) 

Adding eqs. (72.1) and (72.2), and doing some integrations-by-parts in the 
quadratic terms, we find 

Cym + £ g f = \A^{g^ v d 2 - d p d v )A ev + W^d^A™ 

- gf abc A a ^A bu d p A c u ~ \g 2 f abe f cde A a ^A bu A c ^A d v . (72.3) 

The first line of eq. (72.3) yields the gluon propagator in R % gauge, 

Ah*) = G - k ~w + t k ~w) ■ < 72 - 4 > 

The second line of eq. (72.3) yields three- and four-gluon vertices, shown in 
fig. (72.1). The three-gluon vertex factor is 

i^upiP^QA) = i(-gf abc )(-irp,gvp) 

+ [5 permutations of (a,/i,p), ( b,v,q ), (c,py) ] 

= gf abc [(q-r)vg„ P + {r-p)„g PP + ( p-q) P g P u } • (72.5) 

The four-gluon vertex factor is 

= -ig 2 f abe f cde gppgv« 

+ [5 permutations of (6,u), (c,p), ( d,o ) ] 

= -ig 2 [ f abe f cde {gp P gva - g^g^) 

+ f ace f dbe {gpag P u - gpvgpo) 

+ f ade f bce {g P vga P - gp P g<ru) ] 


(72.6) 
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Figure 72.1: The three-gluon and four-gluon vertices in nonabelian gauge 
theory. 

These vertex factors are quite a bit more complicated that the ones we 
are used to, and they lead to rather involved formulae for scattering cross 
sections. For example, the tree-level gg — ► gg cross section (where g is a 
gluon), averaged over initial spins and colors and summed over final spins 
and colors, has 12,996 terms! Of course, many are identical and the final 
result can be expressed much more simply, but this is no help to us at the 
initial stages of computation. For this reason, we postpone any attempt 
at tree- level calculations until section 81, where we will make use of some 
techniques (color ordering and Gervais-Neveu gauge) that, combined with 
spinor-helicity methods, greatly reduce the necessary labor. 

For loop calculations, we need to include the ghosts. The ghost la- 
grangian is 


£ g h = ~d^c b D b ;c c 

= -d^cfd^c 0 + igd f *ct , A“(TZ) bc c c 
= -d^d^c c + gf abc A^d ll c h c c . (72.7) 

The ghost propagator is 

rafe 

A ab {k 2 ) = -T- • (72.8) 

k z — ie 

Because the ghosts are complex scalars, their propagators carry a charge 
arrow. The ghost-ghost-gluon vertex shown in fig. (72.2); the associated 
vertex factor is 


iVf=(g,r) = i(gf abc )(-iq „) 

= gf abc % • 


(72.9) 
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Figure 72.2: The ghost-ghost-gluon vertex in nonabelian gauge theory. 
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Figure 72.3: The quark-quark-gluon vertex in nonabelian gauge theory. 

If we include a quark coupled to the gluons, we have the quark la- 
grangian 


£q = ,D jj'V j - ///T;Vl/, 

= iVifiBi - + gA°* i 'fT? j * j . (72.10) 

The quark propagator is 


Sij{p) 


{—]) + m)5jj 
p 2 + m 2 — ie 


(72.11) 


The quark-quark-gluon vertex shown in fig. (72.3); the associated vertex 
factor is 

iVf/ = itrfTfj ■ (72.12) 

If the quark is in a representation R other than the fundamental, then 7)“ 
becomes (T")^-. 


Problems 

72.1) Consider a complex scalar field ipi in a representation R of the gauge 
group. Find the vertices that involve this field, and the associated 
vertex factors. 
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73 The Beta Function in Nonabelian Gauge 

Theory 

Prerequisite: 70, 72 


In this section, we will do enough loop calculations to compute the beta 
function for the Yang-Mills coupling g. 

We can write the complete lagrangian, including Z factors, as 

C = l 2 Z 3 A a ' J (g lw d 2 - d^d u )A au + Ic^d^A™ 

- Z 3g g f abc A a, ‘A bl ' d tl A c v - \Z Ag g 2 f^T^A^Ai 

- Z 2 rd t ^C a d l ,C a + Z v gf abc A c ^ l C a C b 

+ iZ-AighV; - Z m m%*i + ZujA^rfT^F, . (73.1) 


Note that the gauge-fixing term in the first line does not need a Z factor; 
we saw in section 62 that the ^-dependent term in the propagator is not 
renormalized. 

We see that g appears in several places in £, and gauge invariance leads 
us to expect that it will renormalize in the same way in each place. If 
we rewrite C in terms of bare fields and parameters, and compare with 
eq. (73.1), we find that 


So = 


Z|Z 3 


sV = 


a 2 n £ = a 2 u £ = ^ a 2 u £ 

z 2 z 3 9 1 z \ 9 9 z \ 9 9 


7 2 

z i' 


(73.2) 


where d = 4 — e is the number of spacetime dimensions. To prove eq. (73.2), 
we have to derive the nonabelian analogs of the Ward identities, known as 
Slavnov- Taylor identities. For now, we simply assume that eq. (73.2) holds; 
we will return to this issue in section 74. 

The simplest computation to perform is the renormalization of the 
quark-quark-gluon vertex. This is partly because much of the calculation 
is the same as it is in spinor electrodynamics, and so we can make use of 
our results in section 62. We then must compute Z \ , Z- 2 , and Z 3 . We will 
work in Feynman gauge, and use the MS renormalization scheme. 

We begin with Z 3 - The 0(g 2 ) corrections to the fermion propagator 
are shown in fig. (73.1). These diagrams are the same as in spinor elec- 
trodynamics, except for the factors related to the color indices. The loop 
diagram has a color-factor of (T a T a )ij = C Y (R)<%. (Here we have allowed 
the quark to be in an arbitrary representation R; for notational simplicity, 
we will continue to omit the label R on the generator matrices.) In section 
62, we found that, in spinor electrodynamics, the divergent part of this 
diagram contributes —(e 2 /8Tr 2 e)tf to the electron self-energy S($f). Thus 
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Figure 73.1: The one-loop and counterterm corrections to the quark prop- 
agator in quantum chrorno dynamics. 
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Figure 73.2: The one-loop corrections to the quark-quark-gluon vertex in 
in quantum chromodynamics. 


in Yang-Mills gauge theory, the divergent part of this diagram contributes 
— (g 2 /8n 2 e)C (R)5ijtf to the quark self-energy This divergent term 

must be cancelled by the counterterm contribution of — (Z 2 — There- 
fore, in Yang-Mills theory, with a quark in the representation R, using 
Feynman gauge and the MS renormalization scheme, we have 


^2 




(73.3) 


Moving on to the quark-quark-gluon vertex, we the contributing one- 
loop diagrams are shown in fig. (73.2). The first diagram is again the same 
as it is in spinor electrodynamics, except for the color factor of (T b T a T b )ij . 
We can simplify this via 


rj-ibrjiCLrjib rpb _j_ j j^ClbCfJ-iC^ 

= C(R)T a + \if abc [T b ,T C ) 

= C(R)T a + ±(if abc )(if bcd )T a 
= C(R)T a - \{T“) bc {T d ) cb T d 


C( R) - \T{ A) 


(73.4) 


In the second line, we used the complete antisymmetry of f abc to replace 
T b T c with \[T b ,T c ]. To get the last line, we used Tr (T“Tf) = T(A)5 ad . In 
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section 62, we found that, in spinor electrodynamics, the divergent part of 
this diagram contributes (e 2 /87T 2 £)ie'y p to the vertex function iY p (p',p). 
Thus in Yang-Mills theory, the divergent part of this diagram contributes 


C( R) - ^T(A) 


8tt 2 £ 




(73.5) 


to the quark-quark-gluon vertex function i V“ /J ' (// , p) . This divergent term, 
along with any divergent term from the second diagram of fig. (73.2), must 
be cancelled by the tree-level vertex iZigy p T p . 

Now we must evaluate the second diagram of fig. (73.2). The divergent 
part is independent of the external momenta, and so we can set them to 
zero. Then we get a contribution to iV“^(0, 0) of 


(igfgf abc (T c T b ) 




(D7 


d 4 £ Ipi-f+mhv 


(27t) 4 £ 2 £ 2 {£ 2 +m 2 ) 

x [(M-7)V P + M-o )V M + (0 -£) p g^] 


(73.6) 


We can simplify the color factor with the manipulations of eq. (73.4), 


fabcrpcrj-ib 


- \f abc [T c ,T b ] 

1^ j?abcj?cbdrj-<d 

= -\iT(A)T a . 


(73.7) 


The numerator in eq. (73.6) is 


AT = 7 pi-yj* + m) r ) v (2l p 'g vp - £ v g pp - £ p g pv ) . (73.8) 

We can drop the terms linear in £, and make the replacement £° £ p — > 
d~ 1 £ 2 g ap . Thus we have 

N p _ d ~if ( 7 p 7 0 . 7j/ ) (2g ap g vp - g™g pp - g° p g pv ) 

- ~d ~ l £ 2 - i P Yr) 

-+ -d~ 1 £ 2 (2(d-2) + d + d)y p . (73.9) 


Because we are only keeping track of the divergent term, we are free to set 
d = 4, which yields 


N p -> -3£V • (73.10) 

Using eqs. (73.7) and (73.10) in eq. (73.6), we get 


d A £ 


1 


lT(A)g 3 T^ p 


(2i r) 4 £ 2 (£ 2 +m 2 ) 


(73.11) 
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Figure 73.3: The one-loop and counterterm corrections to the gluon prop- 
agator in quantum chromodynamics. 


After continuing to d dimensions, the integral becomes i/ 8 n 2 £ + O(e 0 ). 
Combining eqs. (73.5) and (73.6), we find that the divergent part of the 
quark-quark-gluon vertex function is 


V“f(0,0) div =[Z 1 + [c(R)-iT(A)l ^ + §T( A) 


8 n 2 £ 


8tt 2 £ 




Requiring V“^ (0, 0) to be finite yields 


(73.12) 


Zi = 1 - 


C(R) + T( A) 




(73.13) 


in Feynman gauge and the MS renormalization scheme. 

Note that we have found that Z\ does not equal Z 2 . In electrodynamics, 
we argued that gauge invariance requires all derivatives in the lagrangian 
to be covariant derivatives, and that both pieces of D jL = d jx — ieA^ should 
therefore be renormalized by the same factor; this then implies that Z\ 
must equal Z^. In Yang-Mills theory, however, this argument fails. This 
failure is due to the introduction of the ordinary derivative in the gauge- 
fixing function for R £ gauge: once we have added £ g f and £ g h to T Y m, we 
find that both ordinary and covariant derivatives appear. (This is especially 
obvious for £ g h.) Therefore, to be certain of what gauge invariance does and 
does not imply, we must derive the appropriate Slavnov-Taylor identities, 
a subject we will take up in section 74. 

Next we turn to the calculation of Z 3 . The 0(g 2 ) corrections to the 
gluon propagator are shown in fig. (73.3). The first diagram is proportional 
to / d A l/l 2 ] as we saw in section 65, this integral vanishes after dimensional 
regularization. 
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The second diagram yields a contribution to Hl puab (k) of 

1 2 f acd f bcd(lf [ dH 

2 9J 1 W J (2tt) 4 £ 2 {£+k) 2 ’ 

where the one-half is a symmetry factor, and 


(73.14) 


N pu = [{k+T)-{-T)Yg p(T + (-£-(-k)) p g ap + ((-k)-(k+£)y 7 g pp ] 

x[(-k-e)-ey gp<T + o t-k ) p 8 a u + (k-(-k-£)) a 8 u p ] 

= -[(2 l+kY l g pa - (£—k) p g ap - (£+2 k) u g pp ] 

x [(2 £+ky 9pa - (£—k) p 8 a u - (£+2 k) a 8%] (73.15) 

The color factor can be simplified via f acd f bcd = T(A)5 ab . We combine de- 
nominators with Feynman’s formula, and continue to d = 4 — e dimensions; 
we now have 

-i/u A)S <*gfy x f^ J jy w , ( 73 . 16 ) 

where D = x(l—x)k 2 and q = £ + xk. The numerator is 

N pu = —[(2q+(l—2x)k) p 'g pa — (q—(l+x)k) p g <7p — (q+(2— x)k) a g pp ] 
x[(2q+(l-2x)ky g pa - (q-(l+x)k) p S (T u - {q+(2-x)k) a 5 u p ] . 

(73.17) 


Terms linear in q will integrate to zero, and so we have 

N pv -> - 2 q 2 g pu - (4d-6)q p q u 

- [(1+x) 2 + (2-x) 2 ]k 2 g pv 

- [d(l-2xf + 2(l-2x)(l+x) 

- 2(2-x)(l+x) - 2(2-x)(l-2. x)]k p k v . (73.18) 

Since we are only interested in the divergent part, we can go ahead and set 
d = 4 in the numerator. We can also make the replacement q^q v —> \q 2 g pu ■ 
Then we find 

N pv — - | q 2 g pv - (5-2x+2 x 2 )k 2 g pv + (2+10x-10. x 2 )k p k u . (73.19) 

We saw in section 62 that, when integrated against ( q 2 + D)~ 2 , q 2 can be 
replaced with (| — 1 )~ 1 D; in our case this is — 2x(l— x)k 2 . This yields 

N IW -> - (5— llx+llx^&V*' + (2+10x-10 x 2 )k p k u . 


(73.20) 
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We now use 


h 


d d q 1 


( 2n) d ( q 2 + D ) 2 87r 2 e 


+ O(e 0 ) 


in eq. (73.16) to get 


167 r 2 


T(A)5 ab - f 1 dx N ^ + O(e 0 ) . 
e 4o 


Performing the integral over x yields 
_ ig2 

167 r 2 ' ' e 


T(A)<5 a6 - (-f k 2 g^ + A;") 


(73.21) 


(73.22) 


(73.23) 


as the divergent contribution of the second diagram to iW ,uab (k) . 

Next we have the third diagram of fig. (73.3), which makes a contribu- 
tion to iW LVab {k) of 


(-1 )g 2 f acd f bdc (\ 



dH (i+kyi v 
(2tt) 4 £ 2 {£+k) 2 ’ 


(73.24) 


where the factor of minus one is from the closed ghost loop. The color factor 
is f acd f bdc = — T(A)5 ab . After combining denominators, the numerator 
becomes 


{i+kyf = (q + ( 1 — x)kY{q — xk) u 
—■ ► \q 2 g^ v — x(l—x)k ,J ’k 1 ' 

— > — \x(l—x)k 2 g^ u — x(l— x)^^ . (73.25) 

We then use eq. (73.21) in eq. (73.24); performing the integral over x yields 

- T(A)5 ab \ (-^V" - (73.26) 

as the divergent contribution of the third diagram to iW wab (k). 

Finally, we have the fourth diagram. This is the same as it is in spinor 
electrodynamics, except for the color factor of TV (T a T b ) = T(R)5 ab . If 
there is more than one flavor of quark, each contributes separately, leading 
to a factor of the number of flavors n F . Then, using our results in section 
62, we find 

- n F T(R)S ab - f k 2 g llv - kW) (73.27) 

67T- £ v / 

as the divergent contribution of the fourth diagram to jTl ,luab (k). 
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Adding up eqs. (73.23), (73.26), and (73.27), as well as the counterterm 
contriubtion, we find that the gluon self-energy is transverse, 

n ^ ab (k) = H(k 2 )(k 2 g^ - k^k u )5 ab , (73.28) 

and that 

n(fc 2 )div = - (Za-1) + [§T( A) - |n P T(R)] i + 0(<? 4 ) . (73.29) 

Thus we find 

Z 3 = l+ [|T(A) - |n F T(R)] + 0(g A ) (73.30) 

in Feynman gauge and the MS renormalization scheme. 

Let us collect our results: 

Zx = l- [C(R) + T( A)] £; - £ + 0(g 4 ) , (73.31) 

Z 2 = l~ C(R) £2 \ + 0(g 4 ) , (73.32) 

Z 3 = l+ [§T(A) - |n F T(R)] ± + 0(g A ) , (73.33) 

in Feynman gauge and the MS renormalization scheme. We define 

q 2 

a=%- . (73.34) 

47T 

Then we have 

Z 2 

«° = • (73.35) 

Let us write 

in (zyzyzf) = ± . ( 73 . 36 ) 

n = 1 £ 

Then we have 

In a 0 = X] + In o + e In jl . (73.37) 

n = 1 £ 

From eqs. (73.31-73.33), we get 

Gi(o) = - [^T(A) - |n F T(R)j + 0( a 2 ) . (73.38) 

J Z7T 

Then, the general analysis of section 28 yields 


/3(a) = a 2 G' 1 (a ) , 


(73.39) 
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where the prime denotes differentiation with respect to a. Thus we find 

,,2 


r\ 

(3(a) = - fT( A) - §n F T(R) — + 0(a a ) 


(73.40) 


in nonabelian gauge theory with n F Dirac fermions in the representation R 
of the gauge group. 

We can, if we like, restate eq. (73.40) in terms of g as 


/%) 


fT(A) - fn F T(R) 


167T 2 


+ 0(g 5 ). 


(73.41) 


To go from eq. (73.40) to eq. (73.41), we use a = g 2 /3 tt and a = gg/2n, 
where the dot denotes d/dln ji. 

In quantum chromodynamics, the gauge group is SU(3), and the quarks 
are in the fundamental representation. Thus T( A) = 3 and T(R) = and 
the factor in square brackets in eq. (73.41) is 11 — |n F . So for n F < 16, the 
beta function is negative: the gauge coupling in quantum chromodynamics 
gets weaker at high energies, and stronger at low energies. 

This has dramatic physical consequences. Perturbation theory cannot 
serve as a reliable guide to the low-energy physics. And indeed, in nature we 
do not see isolated quarks or gluons. (Quarks, in particular, have fractional 
electric charges and would be easy to discover.) The appropriate conclusion 
is that color is confined: all finite-energy states are invariant under a global 
SU(3) transformation. This has not yet been rigorously proven, but it is 
the only hypothesis that is consistent with all of the available theoretical 
and experimental information. 


Problems 

73.1) Compute the beta function for g in Yang-Mills theory with a complex 
scalar field in the representation R of the gauge group. Hint: all the 
real work has been done already in this section, problem 72.1, and 
section 66. 

73.2) Write down the beta function for the gauge coupling in Yang-Mills 
theory with several Dirac fermions in the representations Rj, and 
several complex scalars in the representations R) . 

73.3) Compute the one-loop contributions to the anomalous dimensions of 
m, T, and A 
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74 BRST Symmetry 

Prerequisite: 70, 71 

In this section we will rederive the gauge-fixed path integral for nonabelian 
gauge theory from a different point of view. We will discover that the 
complete gauge-fixed lagrangian, C = C Y m + £ g f + £ g h, still has a residual 
form of the gauge symmetry, known as Becchi-Rouet-Stora- Tyutin symme- 
try i, or BRST symmetry for short. BRST symmetry can be used to derive 
the Slavnov-Taylor identities that, among other useful things, show that 
the coupling constant is renormalized by the same factor at each of its ap- 
pearances in C. Also, we can use BRST symmetry to show that gluons 
whose polarizations are not both spacelike and transverse (perpendicular 
to the four-momentum) decouple from physical scattering amplitudes (as 
do particles that are created by the ghost field). 

Consider a nonabelian gauge theory with a gauge field A“(x), and a 
scalar or spinor field fa(x) in the representation R. Then, under an in- 
finitesimal gauge transformation parameterized by 8 a (x), we have 


6A*(x) = - D ab 9 b {x ) , (74.1) 

&fa(x) = - ig6 a (x)(T^) ij (l) j (x ) . (74.2) 

We now introduce a scalar Grassmann field c a {x) in the adjoint represen- 
tation; this field will turn out to be the ghost field that we introduced in 
section 71. We define an infinitesimal BRST transformation via 

6 b A“(x) = Dfc\x) (74.3) 

= d,c a (x) - gf abc A^x)c\x) , (74.4) 

fafa(x) = igc a (x)(T^)ij(j) j (x) . (74.5) 


This is simply an infinitesimal gauge transformation, with the ghost field 
c a (x ) in place of the infinitesimal parameter —9 a {x). Therefore, any combi- 
nation of fields that is gauge invariant is also BRST invariant. In particular, 
the Yang-Mills lagrangian C Y m (including the appropriate lagrangian for 
the scalar or spinor field fa) is BRST invariant, 

^'b^ym = 0 . (74.6) 

We now place a further restriction on the BRST transformation: we 
require a BRST variation of a BRST variation to be zero. This requirement 
will determine the BRST transformation of the ghost field. Consider 


fa(fafa) — fa{^BC a ){T^)ij(f>j — igc a (T^)ijd B (f>j . 


(74.7) 
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There is a minus sign in front of the second term because <5 B acts as an 
anticommuting object, and it generates a minus sign when it passes through 
another anticommuting object, in this case c a . Using eq. (74.5), we have 

S B (S B <f>i) = ig(S B c a ) - g 2 c a c b (T£T b ) ik <t> k . (74.8) 

We now use c b c a = —c a c b in the second term to replace T“T^ with its anti- 
symmetric part, 7j[T£,T^\ = \f abc T^. Then, after relabeling some dummy 
indices, we have 

<y B (<y B &) = ig(s B c c + yf abc c a c b )(T£) ij <j> j . (74.9) 

The right-hand side of eq. (74.9) will vanish for all 4>j(x) if and only if 

S B c c (x) = -\gf abc c a {x)c b {x) . (74.10) 

We therefore adopt eq. (74.10) as the BRST variation of the ghost field. 

Let us now check to see that the BRST variation of the BRST variation 
of the gauge field also vanishes. From eq. (74.4) we have 

6 b (5 b A“) = (6*% - gf abc A^){5 B c b ) - gf abc (5 B A^c b 

= Df{5 B c b )-gf abc {Dfc d )c b 

= D ab {5 B c b ) - 5 /“ 6 c (^c c )c 6 + g 2 f abc f cde A e ^c d c b . (74.11) 

We now use the antisymmetry of f abc in the second term to replace (0 M c c )c 6 
with its antisymmetric part, 

(V [ V ] = |( W - KvV 

= ±(d M cV + \c c {d^c b ) 

= \Mc c c b ) . (74.12) 

Similarly, we use the antisymmetry of c d c b in the third term to replace 
fnbcj'cde w j^] 1 its antisymmetric part, 

1 ^j?abcj?cde j?adcj?cbe^ — r p^ ) ^ ac ^ j^d^ce ^jnd^ac ^rj-ib^ce j 

= ~\if bdh (T b ) ae 

= -lfbdhfhae > ( 74 . 1 . 3 ) 

which is just the Jacobi identity. Now we have 

5 b (5 b A;) = D ab (5 B c b ) - \gf abc {d^c c c b ) - \g 2 f bdh f hae A^c d c b 
= Df (S B c h ) - (rt M - gf ahe A^gf bch c c c b 
= D; h (6 B c h + \gf bch c b c c ) . 


(74.14) 
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We see that this vanishes if the BRST variation of the ghost field is given 
by eq. (74.10). 

Now we introduce the antighost field c“(x). We take its BRST trans- 
formation to be 

5 B ^(x) = B a (x ) , (74.15) 

where B a (x ) is a commuting (as opposed to Grassmann) scalar field, the 
Lautrup-Nakanishi auxiliary field. Because B a (x ) is itself a BRST variation, 
we have 

5 B B a (x) = 0. (74.16) 

Note that eq. (74.15) is in apparent contradiction with eq. (74.10). How- 
ever, there is actually no need to identify c a {x) as the hermitian conjugate 
of c a (x). The role of these fields (in producing the functional determinant 
that must accompany the gauge-fixing delta functional) is fulfilled as long 
as c a {x) and c“( x ) are treated as independent when we integrate them; 
whether or not they are hermitian conjugates of each other is irrelevant. 
We identified them as hermitian conjugates in section 71 only for the sake 
of familiarity in deriving the associated Feynman rules. Now, however, we 
must abandon this notion. In fact, it will be most convenient to treat c a {x) 
and c a {x) as two real Grassmann fields. 

Now that we have introduced a collection of new fields — c a (x), c%t), 
and B a (x) — what are we to do with them? 

Consider adding to C Y m a new term that is the BRST variation of some 
object O , 

£ = £ ym + 5 b O. (74.17) 

Clearly C is BRST invariant, because L YM is, and because <5 B (<5 B 0) = 0. 
We will see that adding 5 b O corresponds to fixing a gauge; which gauge we 
get depends on O. 

We will choose 


0(x) 


<?{x)\\ZB a (x) - G a (x) 


(74.18) 


where G a {x) is a gauge-fixing function, and £ is a parameter. If we further 
choose 


G a (x ) = d»A“(x) , 


(74.19) 


then we end up with gauge. 

Let us see how this works. We have 


5 b O = (5 B ?)\\tB a - d^A 


— (f 




(74.20) 


There is a minus sign in front of the second set of terms because <J B acts 
as an anticommuting object, and so it generates a minus sign when it 
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passes through another anticommuting object, in this case c°. Now using 
eqs. (74.3), (74.15), and (74.16), we get 

b B 0 = \iB a B a - + ^d^ L D ab c b . (74.21) 

We see that the last term is the ghost lagrangian £ g h that we found in 
section 71. If we like, we can integrate the ordinary derivative by parts, so 
that it acts on the antighost field, 

6 b O -> \iB a B a - B a d^A“ - d^?D ab c b . (74.22) 

Examining the first two terms in eq. (74.22), we see that no derivatives 
act on the auxiliary field B a (x). Furthermore, it appears only quadratically 
and linearly in 5 b O. We can, therefore, perform the path integral over it; 
the result is equivalent to solving the classical equation of motion 

= £B a (x) - 8^(x) = 0 , (74.23) 

and substituting the result back into S b O. This yields 

5 b O -> -^d'A^Al - d^c a D ab c b . (74.24) 

We see that the first term is the gauge-fixing lagrangian £ g f that we found 
in section 71. 

We now take note of all the symmetries of the action S = f d 4 x£, where 
C = C Y M + d B 0. With our choice of O, they are: (1) Lorentz invariance; (2) 
the discrete symmetries of parity, time reversal, and charge conjugation; (3) 
global gauge invariance (that is, invariance under a gauge transformation 
with a spacetime-independent parameter 6 a ); (4) BRST invariance; (5) 
ghost number conservation; and (6) antighost translation invariance. 

Global gauge invariance simply requires every term in C to have all 
the group indices contracted in a group-invariant manner. Ghost number 
conservation corresponds to assigning ghost number +1 to c a , — 1 to c a , and 
zero to all other fields, and requiring every term in C to have ghost number 
zero. Antighost translation invariance corresponds to c a {x ) — ► c a {x) + x, 
where x is a Grassmann constant. This leaves C invariant because, in the 
form of eq. (74.22), C contains only a derivative of c b (x). 

We now claim that C already includes all terms consistent with these 
symmetries that have coefficients with positive or zero mass dimension. 
This means that we will not encounter any divergences in perturbation 
theory that cannot be absorbed by including a Z factor for each term 
in C. Furthermore, loop corrections should respect the symmetries, and 
BRST symmetry requires that g renormalize in the same way at each of 
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its appearances. (Filling in the mathematical details of these claims is a 
lengthy project that we will not undertake.) 

We can regard a BRST transformation as infinitesimal, and hence con- 
struct the associated Noether current via the standard formula 

J ' SW = ? 80 ,* J ( I )) W ' W ’ (7425) 

where 4>/(x) stands for all the fields, including the matter (scalar and/or 
spinor), gauge, ghost, antighost, and auxiliary fields. We can then define 
the BRST charge 

Qb = I d 3 xj° B (x) . (74.26) 

If we think of c a (x) and c“(x) as independent hermitian fields, then Q B is 
hermitian. The BRST charge generates a BRST transformation, 


i[Q B ,A“(x)] =Dfc\ x), (74.27) 

i{Q B ,c a (x)} = -± gf abc c b (x)c c (x) , (74.28) 

»{Q B ,e°(x)} = B a {x) , (74.29) 

i[Q B ,B a (x)} = 0, (74.30) 

i[Q B ,Mx)]± = igc a (x)(T£) i j</) j (x ) . (74.31) 


where {A, B} = AB + BA is the anticommutator, and [ , ]± is the commu- 
tator if fa is a scalar field, and the anticommutator if 0, is a spinor field. 
Also, since the BRST transformation of a BRST transformation is zero, Q B 
must be nilpotent , 

Ql = 0 . (74.32) 

Eq. (74.32) has far-reaching consequences. In order for it to be satisfied, 
many states must be annihilated by Q B ; such states are said to be in the 
kernel of Q B . A state \ijj) which is annihilated by Q B may take the form 
of Q b acting on some other state; such states are said to be in the image 
of Q b . There may be some states in the kernel of Q B that are not in the 
image; such states are said to be in the cohomology of Q B . Two states in 
the cohomology of Q B are identified if their difference is in the image; that 
is, if Qb|V’) = 0 but \ij>) / Q b \x) f° r an Y state |%), and if |V’ / ) = IV’) + Qb|C) 
for some state |£), then we identify \ip) and |^ 7 ) as a single element of the 
cohomology of Q B . 

Note any state in the image of Q B has zero norm, since if |V>) = Q B |x)> 
then {ip\ij}) = (ip\Q B \x) = 0- (Here we have used the hermiticity of Q B to 
conclude that Q B \i/r) = 0 implies (ip\Q B = 0.) 
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Now consider starting at some initial time with a normalized state | ip) 
in the cohomology: (ip\ip) = 1, Q b|V ; ) = 0, | ip) / Q B |x)- (This last equation 
is actually redundant, because if \ip) = Q B |x) for some state |x), then l^) 
has zero norm.) Since C is BRST invariant, the hamiltonian that we derive 
from it must commute with the BRST charge: [ H , Q B ] = 0. Thus, an 
initial state \ip) that is annihilated by Q B must still be annihilated by it at 
later times, since Q B e~ lHt \ip) = e~ iHt Q B \ip) = 0. Also, since unitary time 
evolution does not change the norm of a state, the time-evolved state must 
still be in the cohomology. 

We now claim that the physical states of the theory correspond to the 
cohomology of Q B . We have already shown that if we start with a state 
in the cohomology, it remains in the cohomology under time evolution. 
Consider, then, an initial state of widely separated wave packets of incoming 
particles. According to our discussion in section 5, we can treat these states 
as being created by the appropriate Fourier modes of the fields, and ignore 
interactions. We will suppress the group index (because it plays no essential 
role when interactions can be neglected) and write the mode expansions 


A^{x) 

= £ /*[er(k)o A (k)e ifa +<(k)4(k)e- ifa ], 

(74.33) 


A=>,<, 


c(x) 

= Jdk [c(k)e ikx + c\k)e~ ikx ] , 

(74.34) 

c(x) 

= Jdk [b(k)e ikx + b\k)e~ ikx ] , 

(74.35) 

<f(x) 

= J dk [o 0 ( k)e ikx + a\(k)e ~ ikx ] , 

(74.36) 


Here, for maximum simplicity, we have taken (p(x) to be a real scalar field. 
(This is possible if R is a real representation.) In eq. (74.33), we have 
included four polarization vectors that span four-dimensional spacetime. 
For = (cu, k) = a;(l, 0, 0, 1), we choose these four polarization vectors to 
be 

4(k) = 75(1,0,0.1) , 

£ <( k ) = 7f (!’ O’ o, — 1) , 
e+(k) = 75 ( 0 , 1, -*, 0 ) , 

£-(k) = 75 ( 0 , 1 ,+*,°) . (74.37) 

The first two of these, > and <, are lightlike vectors; £>(k) is parallel to 
k and £<(k) is spatially opposite. The latter two, + and — , are spacelike 
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and transverse: they correspond to physical photon polarizations of definite 
helicity. 

We set g = 0, plug eqs. (74.33-74.36) into eqs. (74.27-74.31), and use 
eq. (74.23) to eliminate the auxiliary field. Matching coefficients of e ~ lkx , 
we find 


[Q B ,<4(k)] = \f2uj 5\> c t (k) , (74.38) 

{Qbj c^(k)} = 0 , (74.39) 

{Q B ,6 t ( k)} = f^a^k) , (74.40) 

[Q B> 4(k)]=0. (74.41) 


Consider a normalized state \iJj) in the cohomology: |V’) = 1; Q b|V’) = 0. 

Eq. (74.38) tells us that if we add a photon with the unphysical polarization 
> by acting on \ijj) with a>(k), then this state is not annihilated by Q B \ 
hence the state at > (k)|'0) is not in the cohomology. Eq. (74.40) tells us that 
the state a<(k)|'!/>) is proportional to Q B b^( k)|?/>}; hence the state a<(k)|V>) 
is also not in the cohomology. On the other hand, the states a+( k)|^) and 
aL(k)|^) are annihilated by Q B , but they cannot be written as Q B acting 
on some other state; hence these states are in the cohomology. Also, by 
similar reasoning, the state with one extra <f> particle, a^( k)|^), is in the 
cohomology. 

Eq. (74.38) tells us that if we add a ghost particle by acting on \ip) 
with c^(k), then this state is proportional to Q B a>(k)|^); hence the state 
ci(k)|^) is not in the cohomology. Eq. (74.40) tells us that if we add an 
antighost particle by acting on | i/j) with &t(k), then this state is not anni- 
hilated by Q b , hence the state ih(k)|^>) is also not in the cohomology. 

We conclude that the only particle creation operators that do not take 
a state out of the cohomology are a^(k), a+(k), and al(k). Of course, it 
is precisely these operators that create the expected physical particles. 

Finally, we note that the vacuum |0) must be in the cohomology, because 
it is the unique state with zero energy and positive norm. 

Thus we can conclude that we can build an initial state of widely sep- 
arated particles that is in the cohomology only if we do not include any 
ghost or antighost particles, or photons with polarizations other than + 
and — . Since a state in the cohomology must evolve to another state in the 
cohomology, no ghosts, antighosts, or unphysically polarized photons can 
be produced in the scattering process. 


Reference Notes 


A detailed treatment of BRST symmetry can be found in Weinberg II. 
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Problems 

74.1) The creation operator for a photon of positive helicity can be written 
as 

4(k) = -i 4*(k) J d 3 x e +ikx d 0 A„[x) . (74.42) 

Consider the state a+(k)|^):., where | ip) is in the BRST cohomology. 
Define a gauge-transformed polarization vector 

e^(k)=4(k ) + ck ft , (74.43) 

where c is a constant, and a corresponding creation operator a+(k). 
Show that 

ai(k)IV’) = a+OOIV’) + Qb|x) . (74.44) 

which implies that a+(k)|V>} and a+ (k)|^) represent the same element 
of the cohomology, and hence are physically equivalent. Find the state 

lx)- 



75: Chiral Gauge Theories and Anomalies 


443 


75 Chiral Gauge Theories and Anomalies 

Prerequisite: 70, 72 


So far, we have only discussed gauge theories with Dirac fermion fields. 
Recall that a Dirac field 4/ can be written in terms of two left-handed Weyl 
fields x and £ as 


-L = 



(75.1) 


If T is in a representation R of the gauge group, then x and £' must be as 
well. Equivalently, x must be in the representation R, and £ must be in the 
complex conjugate representation R. (For an abelian theory, this means 
that if 4/ has charge +Q, then x das charge +Q and £ has charge —Q.) 
Thus a Dirac field in a representation R is equivalent to two left-handed 
Weyl fields, one in R and one in R. 

If the representation R is real, then we can have a Majorana field 


4/ = 



(75.2) 


instead of a Dirac field; the left-handed Weyl field ip mid and its hermitian 
conjugate ip' are both in the representation R. Thus a Majorana field in a 
real representation R is equivalent to a single left-handed Weyl field in R. 

Now suppose that we have a single left-handed Weyl field ip in a complex 
representation R. Such a gauge theory is automatically parity violating 
(because the right-handed hermitian conjugate of the left-handed Weyl field 
is in an inequivalent represetnation of the gauge group), and is said to be 
chiral. The lagrangian is 

£ = - \F a ^F« v , (75.3) 

where D ^ — igA®T£. Since is a hermitian matrix (even when R 

is a complex representation), iip^o^D^ip is hermitian (up to a total diver- 
gence, as usual). We cannot include a mass term for ip, though, because 
ipip transforms as R <8> R, and R <8> R does not contain a singlet if R is 
complex. Thus, ipip is not gauge invariant. But without a mass term, this 
lagrangian would appear to possess all the required properties: Lorentz 
invariance, gauge invariance, and no terms with coefficients with negative 
mass dimension. 

However, it turns out that most chiral gauge theories do not exist as 
quantum field theories; they are anomalous. The problem can ultimately 
be traced back to the functional measure for the fermion field; it turns out 
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that this measure is, in general, not gauge invariant. We will explore this 
surprising fact in section 77. 

For now we will content ourselves with analyzing Feynman diagrams. 
We will find an insuperable problem with gauge invariance at the one-loop 
level that afflicts most chiral gauge theories. 

We will work with the simplest possible example: a U(l) theory with a 
single Weyl field V’ with charge +1. The lagrangian is 

£ = - igA^ - \F^ V . (75.4) 


We can use the following trick to write this theory in terms of a Dirac field 
'k. We note that 



(75.5) 


where P h = 5(1—75) is the left-handed projection matrix, does not involve 
the right-handed components of T. Then we can write eq. (75.4) as 


C = - igA^Pi* - \F< w F ilv , (75.6) 


and treat f as a Dirac field when we derive the Feynman rules. 

To better understand the physical consequences of eq. (75.5), consider 
the case of a free field. The mode expansion is 


PA> 


» = Y 

s= d= ' 


dp \b s (p)P Ij u s (p)e ipx + dl(p)P L v s (p)e- ipx 


(75.7) 


For a massless field, we learned in section 38 that P L u + (p) = 0 and 
P L u_(p) = 0. Thus we can write eq. (75.7) as 


Ppl>(x) 


dp 


b_(p)u_{p)e ipx + d\(p)v + (p)e~ ipx 


(75.8) 


Eq. (75.8) shows us that there are only two kinds of particles associated with 
this field (as opposed to four with a Dirac field): h/(p) creates a particle 
with charge +1 and helicity —1/2, and d\ (p) creates a particle with charge 
— 1 and helicity +1/2. In this theory, charge and spin are correlated. 

We can easily read the Feynman rules off of eq. (75.6). In particular, 
the fermion propagator in momentum space is —P^rf/p 2 , and the fermion- 
fermion-photon vertex is ig^^P^. 

When we go to evaluate loop diagrams, we need a method of regulat- 
ing the divergent integrals. However, our usual choice, dimensional reg- 
ularization, is problematic, due to the close connection between 75 and 
four-dimensional spacetime. In particular, in four dimensions we have 


T r [757 M 7 i/ 7 P 7 <T ] = —4 ie^ up<T 


(75.9) 
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Figure 75.1: The one- loop and counterterm corrections to the photon prop- 
agator. 


where e 0123 = +1. It is not obvious what should be done with this formula 
in d dimensions. One possibility is to take d > 4 and define 75 = *7°7 1 7 2 7 3 . 
Then eq. (75.9) holds, but with each of the four vector indices restricted to 
span 0, 1, 2, 3. We also have {7^, 75} = 0 for /a = 0, 1, 2, 3, but [7^, 75] = 0 
for /a > 3. This approach is workable, but cumbersome in practice. 

It is therefore tempting to abandon dimensional regularization in favor 
of, say, Pauli-Villars regularization, which involves the replacement 


P £ . P (zl_ zt±A\ 
V h \p 2 p 2 + A 2 J ' 


(75.10) 


Pauli-Villars regularization is equivalent to adding an extra fermion field 
with mass A, and a propagator with the wrong sign (corresponding to 
changing the signs of the kinetic and mass terms in the lagrangian). But, 
a Dirac field with a chiral coupling to the gauge field cannot have a mass, 
since the mass term would not be gauge invariant. So, in a chiral gauge 
theory, Pauli-Villars regularization violates gauge invariance, and hence is 
unacceptable. 

Given the difficulty with regulating chiral gauge theories (which is a 
hint that they may not make sense), we will sidestep the issue for now, and 
see what we can deduce about loop diagrams without a regulator in place. 

Consider the correction to the photon propagator, shown in fig. (75.1). 
We have 

= (-!)(«)* (I) 

- i(Z 3 -l)(k 2 g ^ - kW) + 0(g 4 ) , (75.11) 

where the numerator is 


= Tr[P L (f.+l,t)^P L PJYPr} ■ (75.12) 

We have P 2 = P L and P ^7^ = 7 fl P R (and hence P L 7^7^ = 7^7 u P h ), and so 
all the P L ’ s in eq. (75.12) can be collapsed into just one; this is generically 
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true along any fermion line. Thus we have 

N ^ = Tr[(f+^fYPr] • (75.13) 

The term in eq. (75.13) with P L — ► ^ simply yields half the result that we 
get in spinor electrodynamics with a Dirac field. 

The term in eq. (75.13) with P L — > — 575 , on the other hand, yields a 
vanishing contribution to I T LV {k). To see this, first note that 

A7- -iTV[(/+^) 7 7 7 - 75 ] 

= 2 i£ a ^ v {i+k) a tp 

= 2 ie a ^ v k a ip . (75.14) 

Thus we have 

n- (t) = i D-ttW - jPh 

- (Z 3 -l)(k 2 g^ - kPk?) + 0{g A ) . (75.15) 


The integral is logarithmically divergent. But, it carries a single vector 
index /3 , and the only vector it depends on is k. Therefore, any Lorentz- 
invariant regularization must yield a result that is proportional to kp. This 
then vanishes when contracted with e a/ ' k a . We therefore conclude that, 
at the one-loop level, the contribution to I l tiU (k) of a single charged Weyl 
field is half that of a Dirac field. This is physically reasonable, since a Dirac 
field is equivalent to two charged Weyl fields. 

Nothing interesting happens in the one-loop corrections to the fermion 
propagator, or the fermion-fermion-photon vertex. There is simply an 
extra factor of P h along the fermion line, which can be moved to the far 
right. Except for this factor, the results exactly duplicate those of spinor 
electrodynamics . 

All of this implies that a single Weyl field makes half the contribution of 
a Dirac field to the leading term in the beta function for the gauge coupling. 

Next we turn to diagrams with three external photons, and no external 
fermions, shown in fig. (75.2). In spinor electrodynamics, the fact that the 
vector potential is odd under charge conjugation implies that the sum of 
these diagrams must vanish; see problem 58.2. For the present case of a 
single Weyl field, there is no charge-conjugation symmetry, and so we must 
evaluate these diagrams. 

The second diagram in fig. (75.2) is the same as the first, with p q 
and g v. Thus we have 


iV^ up (p, q, r ) 


( 1) Off ) 3 ( t ) 3 


dH N^p 

(27t) 4 (l—p) 2 £ 2 (l+q) 2 
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Figure 75.2: One-loop contributions to the three-photon vertex. 

+ (P,T q,v) + 0(g 5 ) , (75.16) 

where 

N^p = Tr[(-f+tf> 1 P(-{)Y(-Mh P Pu] • (75.17) 

The term in eq. (75.17) with P L — ► | simply yields half the result that we 

get in spinor electrodynamics with a Dirac field, which gives a vanishing 
contribution to V Att/p (p, q, r). Hence, we can make the replacement P L — » 
— 7^75 in eq. (75.17). Then, after cancelling some minus signs, we have 

iTr[(/-^) 7 7 7 -(/+^) 7 ^ 5 ] . (75.18) 

We would now like to verify that \ ,wp (p, q, r) is gauge invariant. We 
should have 



p^VP up {p,q,r) = 0 , 

(75.19) 


9 „V^(p,g,r) =0, 

(75.20) 


r p V flup (p,q, r ) = 0 . 

(75.21) 

Let us first check the last of these. From eq. (75.16) we find 



^ TUVPt , . 3 /' dH rpN^P 

rp q P ] - 19 J (27 r ) 4 (£-p)H 2 {£+q) 2 



+ (p,At <-»• q,v) + 0(g 5 ) , 

(75.22) 

where 

r pN ^ p = IT 'riV-fl'ff'fU+firp'f'rs] • 

(75.23) 

It will be 
eq. (75.23) 

convenient to use the cyclic property of the trace to 
as 

rewrite 


r p N pvp = |Tr [^ 7 " (f.+$)r p7 p (/— ^) 7 M 75 ] • 

(75.24) 
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To simplify eq. (75.24), we write = f = — (4+rf) = —(f+tf) + if-rf)- 
Then 

(M)r P Y(M) = (f+4)[Y?+4) + (t-MM) 

= (£+q)\M) - {£-p?{t+i) • (75.25) 

Now we have 


r p N^ = \{£+q) 2 Tr [?Y(M)YY ~ W~V? Tr^+^Ts] 

= -2 is au0fJ {£+q) 2 £ a (£-p)p - (£-p) 2 £ a (£+q)p 

= +2 i£ av ^ [{£+q) 2 £aP(3 + {Z-p)Y Q q 0 \ (75.26) 

Putting eq. (75.26) into eq. (75.22), we get 


rp V^ p {p,q,r) 


_0„3 f \ YP(3 £qQld 

y J (27 r) 4 U 2 {£-p) 2 ^ £ 2 {£+q) 2 _ 

+ (p,T ^ q,v) + °(g 5 ) , (75.27) 


Consider the first term in the integrand. Because the only four-vector 
that it depends on is p, any Lorentz-invariant regularization of its integral 
must yield a result proportional to p a P/3 ■ Similarly, any Lorentz-invariant 
regularization of the integral of the second term must yield a result pro- 
portional to q a qp- Both p a pp and q a qp vanish when contracted with . 
Therefore, we have shown that 


r p V'"' p (p,q,r) = 0, (75.28) 

as required by gauge invariance. 

It might seem that now we are done: we can invoke symmetry among 
the external lines to conclude that we must also have p p \ pvp (p,q,r) = 0 
and q v \ pvp (p, q,r) = 0. However, eq. (75.16) is not manifestly symmetric 
on the exchanges (p, p r, p) and (q. v <-> r,p). So it still behooves us to 
compute either p lt lV pMp (p. q , r) or q u Y pi/p (p, q, r ). 

From eq. (75.16) we find 

„ uvo . , , f d 4 £ PuN pup 

P '‘V = ! » Jw v-pme+tf 

+ (?,/*<-»• q,v) + 0(g 5 ) , (75.29) 


where 


p,N^ p = l 2 Tr[(M)p,Yh U (Mh P Y • 


(75.30) 
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To simplify eq. (75.30), we write p^ p = ]j = — (f.—jf) + Then 

= ■ (75.31) 


Now we have 

= W-p) 2 Tr[W(l+4)Y^\ - 

\{£~p) 2 £ a q/3 ~ t 2 (t~p)a(t+q)0 
i-p) 2 l a qp - i 2 (l-p) a (l-p + p+q)p 
\{£-p) 2 £ a qp - £ 2 (£-p) a (p+q) 0 ] • (75.32) 


= -2 is au(3p 
= -2 ie au(3p 
= -2 ie avPp 


Putting eq. (75.32) into eq. (75.29), we get 

dH 


p^ p (p,q,r) = 2 g 3 e a ^ j 


(27 r) 


ZaQ0 ( Z~P)a(p+q)0 


l 2 (l+q ) 2 (i-p) 2 (i+q ) 2 \ 


+ q,v) + 0 (g b ) , 


(75.33) 


The first term on the right-hand side of eq. (75.33) must vanish, because 
any Lorentz-invariant regularization of the integral must yield a result pro- 
portional to q a q 0 i and this vanishes when contracted with £ ap d p . 

As for the second term, we can shift the loop momentum from t to £+p, 
which results in 

(l-p) a (p+g)/? t a (p+q )0 ^ 

(£-p) 2 (£+q) 2 £ 2 (£+p+q) 2 ' 1 ’ 

We can now use Lorentz invariance to argue that the integral of the right- 
hand side of eq. (75.34) must yield something proportional to (p+q) a (jP+q) 0\ 
this vanishes when contracted with with £ av P p . Thus, we have shown that 
P, i y il ' p {P- q- r ) = 0, as required by gauge invariance, provided that the 
shift of the loop momentum did not change the value of the integral. This 
would of course be true if the integral was convergent. Instead, however, 
the integral is linearly divergent, and so we must be more careful. 

Consider a one-dimensional example of a linearly divergent integral: let 


/ +oo 

' 

-OO 


dx f(x+a ) , 


(75.35) 


where /(Too) = c-t, with c+ and c_ two finite constants. If the integral con- 
verged, then 1(a) would be independent of a. In the present case, however, 
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we can Taylor expand f(x+a) in powers of a, and note that /(Too) = c± 
implies that every derivative of f(x) vanishes at x = Too. Thus we have 

/ +°o r 

dx f{x) + af{ x) + \a 2 f"(x) + ... 

-OO J 

= 7(0) + a(c+ - c_) . (75.36) 

We see that 1(a) is not independent of a. Furthermore, even if we cannot 
assign a definite value to 1(0) (because the integral is divergent), we can 
assign a definite value to the difference 

1(a) -1(0) = a(c+ — c_) . (75.37) 

Now let us return to eqs. (75.33) and (75.34). Define 


'•<*> = yyyy 

Using Lorentz invariance, we can argue that 


(75.38) 


rj fa(t) = A (pTq)a , 


(75.39) 


where A is a scalar that will depend on the regularization scheme. Now 
consider 

fa(t-p) = fa(Z) - V^-^pfa(f) + • • • • (75.40) 

The integral of the first term on the right-hand side of eq. (75.40) is given 
by eq. (75.39). The integrals of the remaining terms can be converted to 
surface integrals at infinity. Only the second term in eq. (75.40) falls off 
slowly enough to contribute. To determine the value of its integral, we 
make a Wick rotation to euclidean space, which yields a factor of i as 
usual; then we have 


/ yy w fM = ' !M ■ (75 - 41> 

where dSp = £ 2 £pdfl is a surface-area element, and dO is the differential 
solid angle in four dimensions. We thus find 




dn tpia 

(27 r) 4 ( £+p+q ) 2 


Qa 1 


\4 4 ga P 


(75.42) 
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where we used U 4 = 2tt 2 . Combining eqs. (75.38-75.42), we find 


f (t~P)a _ A( j !_ 

J (2tt) 4 (£-p) 2 (£+q) 2 A{P+<1 )a 32 tt' 2 Pa ' 


(75.43) 


Using this in eq. (75.33), we find 

. 3 

Pn~V p ‘ vp {p, q, r ) = ^2 £aUl3p Pa{p+Q)p + {P,P^ q, v) + 0{g 5 ) 

= ^^Paqp + Oig 5 ). (75.44) 

An exactly analogous calculation results in 

g„V^(p, q,r) = ^ £ op9p q aP p + 0(g 5 ) . (75.45) 

Eqs. (75.44) and (75.45) show that the three-photon vertex is not gauge 
invariant. Since r p V pup (p, q,r) = 0, eqs. (75.44) and (75.45) also show that 
the three-photon vertex does not exhibit the expected symmetry among 
the external lines. 

This is a puzzle, because the only asymmetric aspects of the diagrams in 
fig. (75.2) are the momentum labels on the internal lines. The resolution of 
the puzzle lies in the fact that the integral in eq. (75.16) is linearly divergent, 
and so shifting the loop momentum changes its value. To account for this, 
let us write 'V pvp {p, q , r) with £ replaced with £ + a, where a is an arbitrary 
linear combination of p and q. We define 


V^(p,g,r;a) = \ig 3 


d A £ Tr[(/+a— ^) 7 ^+a) 7 ^+a+^ 75 ] 
(27t) 4 (£+a—p) 2 (£+a) 2 (£+a+q) 2 


+ (p,p^ q,u) + Q(g 5 ) . 


(75.46) 


Our previous expression, eq. (75.16), corresponds to a = 0. The integral 
in eq. (75.46) is linearly divergent, and so we can express the difference 
between V pup (p, q,r\ a) and 'V pl ' p (p, q, r; 0) as a surface integral. Let us 
write 


\ pup (p, q, r; a) = ±igpl a fa(a) Tr[ 'f'f'f'f'f'fjs] 

+ (P,P q, v) + Q{g 5 ) , (75.47) 


where 


I a ^{a) = J 


dH (£+a—p) a (£+a)^(£+a—q) 1 
(27t) 4 (£+a— p) 2 (£+a) 2 (£+a+q) 2 


(75.48) 
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Then we have 


lafl'y (®) laP'y ( 0 ) 


i a 


xa 


r d 4 £ 

d 

' {t-p) a tp{£-qW 


/ (27t) 4 

d£ 5 

. {£-p) 2 £ 2 (£+q) 2 . 


/• 

1 1 TT1 I 

dU 

h(t-p)a£/3(£-q)'r 

11111 1 

l — >00 J 

(27t) 4 (£— p) 2 {£+q) 2 



U 4 1 / 

(^7r)4 24 dSpQ'ya T SS’ySaf} 


— 199^2 (1/39'ia T ®7 dap'j ■ (75.49) 

Using this in eq. (75.47), we get contractions of the form g a /3'Y a 'y p "~f 13 = 27U 
The three terms in eq. (75.49) all end up contributing equally, and after 
using eq. (75.9) to compute the trace, we find 


V^(p, g ,r;o)-V^(p,g,r; 0) = 

+ (p,A* <-»• g,*') + 0(g 5 ) . (75.50) 


Since the Levi-Civita symbol is antisymmetric on p, 1 /, only the part of 
a that is antisymmetric on p <-> q contributes to \ lwp (p, q, r; a). Therefore 
we will set a = c(p — q ), where c is a numerical constant. Then we have 

V^(p, q, r; a) - V^(p, g,r;0) = -^ ce^(p-q)p + 0(g 5 ) . (75.51) 

Using this, along with eqs. (75.28), (75.44), and (75.45), and making some 
simplifying rearrangements of the indices and momenta on the right-hand 
sides (using p+q+r = 0), we find 


P^ up {p, q, r; a) 

= -|J(1 -c)s^q a r p + 0(g 5 ), 

(75.52) 

q u V pup (p,q,r-a) 

= -c)e pp ^r a p 0 + O(g 5 ), 

(75.53) 

r p V pup (p,q,r-a ) 

= -^(2 c)e p ^p a qg + 0(g 5 ). 

(75.54) 


We see that choosing c = 1 removes the anomalous right-hand side from 
eqs. (75.52) and (75.53), but it then necessarily appears in eq. (75.54). Chos- 
ing c = ^ restores symmetry among the external lines, but now all three 
right-hand sides are anomalous. (This is what results from dimensional reg- 
ularization of this theory with 75 = *7°7 1 7 2 7 3 .) We have therefore failed to 
construct a gauge-invariant U(l) theory with a single charged Weyl field. 
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Consider now a U(l) gauge theory with several left-handed Weyl fields 
'ipi, with charges Qi, so that the covariant derivative of ^ is ( d fl —igQiA fl )'ipi . 
Then each of these fields circulates in the loop in fig. (75.2), and each vertex 
has an extra factor of Qi. The right-hand sides of eqs. (75.52-75.54) are 
now multiplied by YliQi ■ And if J2iQi happens to be zero, then gauge 
invariance is restored! The simplest possibility is to have the ■)/’ s come in 
pairs with equal and opposite charges. (In this case, they can be assembled 
into Dirac fields.) But there are other possibilities as well: for example, 
one field with charge +2 and eight with charge —1. Such a gauge theory 
is still chiral, but it is anomaly free. (It could be that further obstacles to 
gauge invariance arise with more external photons and/or more loops, but 
this turns out not to be the case. We will discuss this in section 77.) 

All of this has a straightforward generalization to nonabelian gauge 
theories. Suppose we have a single Weyl field in a (possibly reducible) 
representation R of the gauge group. Then we must attach an extra factor 
of Tr(T£T b Tff to the first diagram in fig. (75.2), and a factor of Tr (T“T^T|) 
to the second; here the group indicies a, b, c go along with the momenta 
p , q , r, respectively. Repeating our analysis shows that the diagrams with 
P L — come with an extra factor of ^Tr([T^, T|]T^) = fT{K)f abc T^\ 
these contribute to the renormalization of the tree-level three-gluon vertex. 
Diagrams with P L — > —^75 come with an extra factor of 

iTr({T“, T b }Tf) = A(R)d abc . (75.55) 

Here d abc is a completely symmetric tensor that is independent of the rep- 
resentation, and A(R) is the anomaly coefficient of R, introduced in sec- 
tion 70. In order for this theory to exist, we must have A(R) = 0. As 
shown in section 70, A(R) = — A(R); thus a theory whose left-handed Weyl 
fields come in R 0 R pairs is automatically anomaly free (as is one whose 
Weyl fields are all in real representations). Otherwise, we must arrange 
the cancellation by hand. For SU(2) and SO(N), all representations have 
A(R) = 0. For SU(IV) with N > 3, the fundamental representation has 
A(N) = 1, and most complex S\J(N) representations R have A(R) / 0. So 
the cancellation is nontrivial. 

We mention in passing two other kinds of anomalies: if we couple our 
theory to gravity, we can draw a triangle diagram with two gravitons and 
one gauge boson. This diagram violates general coordinate invariance (the 
gauge symmetry of gravity). If the gauge boson is from a nonabelian group, 
the diagram is accompanied by a factor of Tr Tf = 0, and so there is no 
anomaly. If the gauge boson is from a U(l) group, the diagram is accom- 
panied by a factor of Qi, and this must vanish to cancel the anomaly. 

There is also a global anomaly that afflicts theories with an odd number 
of Weyl fermions in a pseudoreal representation, such as the fundamental 
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representation of SU(2). The global anomaly cannot be seen in perturba- 
tion theory; we will discuss it briefly in section 77. 

Reference Notes 

Discussions of anomalies emphasizing different aspects can be found in 
Georgi, Peskin & Schroeder, and Weinberg I. 

Problems 

75.1) Consider a theory with a nonabelian gauge symmetry, and also a 
U(l) gauge symmetry. The theory contains left-handed Weyl fields 
in the representations (R*,(5i)> where R* is the representation of the 
nonabelian group, and Qi is the U(l) charge. Find the conditions for 
this theory to be anomaly free. 
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76 Anomalies in Global Symmetries 

Prerequisite: 75 

In this section we will study anomalies in global symmetries that can arise 
in gauge theories that are free of anomalies in the local symmetries (and 
are therefore consistent quantum field theories). A phenomenological ap- 
plication will be discussed in section 90. 

The simplest example is electrodynamics with a massless Dirac field T 
with charge Q = +1. The lagrangian is 

C = i^ipm - \F a ^F^ v , (76.1) 

where Ip = 7 and D M — igA (We call the coupling constant g 

rather than e because we are using this theory as a formal example rather 
than a physical model.) We can write 4/ in terms of two left-handed Weyl 
fields x an d £ via 




(76.2) 

where \ has charge Q = +1 and £ has charge Q = — 1. In terms of 

X and 

£, the lagrangian is 



C = *x t cxA<A - igAJx + i£c rAA + igA .„)£ - \F afiU F“ u . 

(76.3) 

The lagrangian is invariant under a U(l) gauge transformation 


'L(.t) -> 

e -igT(x)^^x) , 

(76.4) 

4/(x) — > 

e +igT(x)^^ ^ 

(76.5) 

A ti (x) — > 

A^(x) — cKT(x) . 

(76.6) 

In terms of the Weyl fields, eqs. (76.4) and (76.5) become 


X(x) - 

► e-^ r W x (x) , 

(76.7) 

Z(x) ~ 


(76.8) 


Because the fermion field is massless, the lagrangian is also invariant under 
a global symmetry in which x and £ transform with the same phase, 
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In terms of 4', this is 


T(x) — ■> e * a75 T(x) , 

(76.11) 

T(x) -> T(x)e" iQ75 . 

(76.12) 


This is called axial U(l) symmetry , because the associated Noether current 

jl{x) = ^(x) 7 ^ 7 5 T(x) (76.13) 

is an axial vector (that is, its spatial part is odd under parity). Noether’s 
theorem leads us to expect that this current is conserved: = 0. How- 

ever, in this section we will show that the axial current actually has an 
anomalous divergence , 

d „ji = ~~£^ e^F^Fpr . (76.14) 

We will see in section 77 that eq. (76.14) is exact; there are no higher-order 
corrections. 

We will demonstrate eq. (76.14) by making use of our results in section 
75. Consider the matrix element (p,q\jA(z)\0) , where (p,q\ is a state of two 
outgoing photons with four-momenta p and q , and polarization vectors 
and £' u , respectively. (We omit the helicity label, which will play no essential 
role.) Using the LSZ formula for photons (see section 67), we have 

MJi(z)\0) = I d 4 xd^e- i ^ + ^(0\Tf(x)f(y)f A (z)\0) , 

(76.15) 

where 

j M (x) = 4'(x)7 /i 'I'(x) (76.16) 

is the Noether current corresponding to the U(l) gauge symmetry. Since 
both j M (x) and j A (x) are Noether currents, we expect the Ward identities 


A-yO\Tr(x)f(y)f^z)\0) = 0 , 

(76.17) 


(76.18) 

-A{ 0 \Tf(x)f(y) 3 Z(z)\ 0 ) = 0, 

(76.19) 


to be satisfied. Note that there are no contact terms in eqs. (76.17-76.19), 
because both j ll (x) and j A (x) are invariant under both U(l) transforma- 
tions. If we use eq. (76.19) in eq. (76.15), we see that we expect 


Jz;{p,q\jA(z)\o) = 0 . 


(76.20) 
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However, our experience in section 75 leads us to proceed more cautiously. 
Let us define C pvp (p. q.r) via 

(2ir) 4 5 4 (p+q+r)C fJ ' up (p, q, r ) 

= J d 4 x d A y d 4 z e~ i ^ px+qy+rz \0\Tj p '(x)j p '(y)j^ L (z)\0) . (76.21) 

Then we can rewrite eq. (76.15) as 


{pA&im = -g 2 e^ u C p " p (p, q, r)e irz 

r=—p—q 

Taking the divergence of the current yields 

(76.22) 

{p,q\d P ji(z)\0) = -ig 2 £ ll £' u r p C p ‘ vp {p,q,r)e irz 

r=—p—q 

The expected Ward identities become 

(76.23) 

PpC pup (p,q,r) = 0 , 

(76.24) 

q u C pvp (p,q.r) = (1 , 

(76.25) 

r p C pup (p,q,r ) = 0 . 

(76.26) 


To check eqs. (76.24-76.26), we compute C pup (p, q,r) with Feynman dia- 
grams. At the one-loop level, the contributing diagrams are exactly those 
we computed in section 75, except that the three vertex factors are now 
7 P , 7 ^, and 7 P 75 , instead of igy p P h , igy v P L , and igy p P h . But, as we saw, 
the three P L ’s can be combined into just one at the last vertex, and then 
this one can be replaced by — ^ 75 . Thus, the vertex function 7V pup (p, q,r) 
of section 75 is related to C pup (p, q,r) by 

■iV pup (p,q,r) = -±(ig) 3 C pup (p,q,r) + 0(g 5 ) . (76.27) 

In section 75, we saw that we could choose a regularization scheme that 
preserved eqs. (76.24) and (76.25), but not also (76.26). For the theory of 
this section, we definitely want to preserve eqs. (76.24) and (76.25), because 
these imply conservation of the current coupled to the gauge field, which is 
necessary for gauge invariance. On the other hand, we are less enamored of 
eq. (76.26), because it implies conservation of the current for a mere global 
symmetry. 

Using eq. (76.27) and our results from section 75, we find that preserving 
eqs. (76.24) and (76.25) results in 

= -^2 ^ vaP v a qp + 0{g 2 ) 


r p C pup (p,q,r) 


(76.28) 
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in place of eq. (76.26). Using this in eq. (76.23), we find 

Mdpji{z) |0> = -|^ a V?/J^e“ <(p+,) * + 0(/) . (76.29) 

Now we come to the point. The right-hand side of eq. (76.29) is exactly 
what we get in free-held theory for the matrix element of the right-hand 
side of eq. (76.14). We conclude that eq. (76.14) is correct, up to possible 
higher-order corrections. 

In the next section, we will see that eq. (76.14) is exact. 

Problems 

76.1) Verify that the right-hand side of eq. (76.29) is exactly what we get 
in free-held theory for the matrix element of the right-hand side of 
eq. (76.14). 
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77 Anomalies and the Path Integral for 

Fermions 

Prerequisite: 76 

In the last section, we saw that in a U(l) gauge theory with a massless 
Dirac field T with charge Q = + 1, the axial vector current 

Ja = *7^75 * , (77.1) 

which should (according to Noether’s theorem) be conserved, actually has 
an anomalous divergence, 

^ pa F^F pa . (77.2) 

In this section, we will derive eq. (77.2) directly from the path integral, 
using the Fujikawa method. We will see that eq. (77.2) is exact; there are 
no higher-order corrections. 

We can also consider a nonabelian gauge theory with a massless Dirac 
field T in a (possibly reducible) representation R of the gauge group. In 
this case, the triangle diagrams that we analyzed in the last section carry 
an extra factor of Tr (T“T*) = T(R)5 a& , and we have 

AA = -r£^T(R)e^d [p A a u] d [p A a a] + 0(g 3 ) , (77.3) 

where d[ p A^ = <9 /t T“ — We expect the right-hand side of eq. (77.3) 

to be gauge invariant (since this theory is free of anomalies in the currents 
coupled to the gauge fields); this suggests that we should have 

AA = - r£^T(R)e^F^F- a , (77.4) 

where Ffj v = d^A^ — d u A a ^ + gf abc A b ^Af, is the nonabelian field strength. 
We will see that eq. (77.4) is correct, and that there are no higher-order 
corrections. 

We can write eq. (77.4) more compactly by using the matrix-valued 
gauge field 

A m = T«A; (77.5) 

and field strength 

F[iv = d p A u - d u A p - ig[A p , A v \ . (77.6) 

Then eq. (77.4) can be written as 


(77.7) 
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We now turn to the derivation of eqs. (77.2) and (77.7). We begin with 
the path integral over the Dirac field, with the gauge field treated as a fixed 
background, to be integrated later. We have 

Z(A) = J e iS{A) , (77.8) 

where 

S(A) = j d 4 x WUpy (77.9) 

is the Dirac action, ilp = is the Dirac wave operator, and 

0^ = 8^- igA , , (77.10) 

is the covariant derivative. Here A ^ is either the U(l) gauge field, or the 
matrix- valued nonabelian gauge field of eq. (77.5), depending on the theory 
under consideration. Our notation allows us to treat both cases simultane- 
ously. 

We can formally evaluate eq. (77.8) as a functional determinant, 

Z(A) = det (ip) . (77.11) 

However, this expression is not useful without some form of regularization. 
We will take up this issue shortly. 

Now consider an axial U(l) transformation of the Dirac field, but with 
a spacetime dependent parameter a(x): 

tf(s) -»• e~ ia(x)75 V(x) , (77.12) 

W(x) -> W(x)e~ ia(xh5 . (77.13) 

We can think of eqs. (77.12) and (77.13) as a change of integration variable 
in eq. (77.8); then Z(A) should be independent of a(x). The corresponding 
change in the action is 

S{A) — > S(A) + J A j)((s)9 M a(i) . (77.14) 

We can integrate by parts to write this as 

S(A) -* S (A) - J d A x a(x)d^ l j'i{x) . (77.15) 

If we assume that the measure VFV'F is invariant under the axial U(l) 
transformation, then we have 


DTDT e iS ^e - * / d ^ x _ 


(77.16) 
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This must be equal to the original expression for Z(A), eq. (77.8). This 
implies that d^j^ (x) = 0 holds inside quantum correlation functions, up to 
contact terms, as discussed in section 22. 

However, the assumption that the measure D'fD't is invariant under 
the axial U(l) transformation must be examined more closely. The change 
of variable in eqs. (77.12) and (77.13) is implemented by the functional 
matrix 

J(x,y) = 5\x-y)e ~ ia (l)75 . (77.17) 

Because the path integral is over fermionic variables (rather than bosonic), 
we get a jacobian factor of (det J) -1 (rather than det J) for each of the 
transformations in eqs. (77.12) and (77.13), so that we have 

VH VF -> (det J)“ 2 VH VF . (77. 18) 

Using log det J = Tr log J, we can write 

(detJ) _2 = exp 2 if d 4 x a(x) Tr 5 4 {x— 2)75 , (77.19) 

where the explicit trace is over spin and group indices. Like eq. (77.11), 
this expression is not useful without some form of regularization. 

We could try to replace the delta-function with a gaussian; this is equiv- 
alent to 

5 4 (x-y) -> e d * /M2 5 4 (x-y) , (77.20) 

where M is a regulator mass that we would take to infinity at the end of the 
calculation. However, the appearance of the ordinary derivative d, rather 
than the covariant derivative D , implies that eq. (77.20) is not properly 
gauge invariant. So, another possibility is 

5 4 {x-y ) -► e D2 / M2 5 4 {x-y) . (77.21) 

However, eq. (77.21) presents us with a more subtle problem. Our regular- 
ization scheme for eq. (77.19) should be compatible with our regularization 
scheme for eq. (77.11). It is not obvious whether or not eq. (77.21) meets 
this criterion, because D 2 has no simple relation to ilp. To resolve this 
issue, we use 

5 4 {x-y) -»• e(^) 2 / M V(x-y) (77.22) 

to regulate the delta function in eq. (77.19). 

To evaluate eq. (77.22), we write the delta function on the right-hand 
side of eq. (77.22) as a Fourier integral, 

S^( x -y)^ j _0_ e {HPA 2 /M\ik(x-y) _ 


(77.23) 
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Then we use f{d)e lkx = e lkx f{d + ik)\ eq. (77.23) becomes 

5 4 {x-y) J e ik{x ~ y) e W-t) 2 /M 2 ; ( 77 . 24 ) 

where a derivative acting on the far right now yields zero. We have 
( Up -Ip ) 2 = f -i{lp, Ip] - Ip 2 

= -k 2 - i{^, Y}k„D v - rYD^D u . (77.25) 

Next we use 7 M 7 I/ = ^({7^,7^} + [7^,7^]) = —g yu — 2iS y,v to get 

{Up - If ) 2 = -k 2 + 2 ik-D + D 2 + 2 iS^D^Dv . (77.26) 


In the last term, we can use the antisymmetry of S 111 ' to replace D^Dy with 
= —7}igF^ w , which yields 

{Up - Ip) 2 = - k 2 + 2 ik-D + D 2 + gS^F^ . (77.27) 


We use eq. (77.27) in eq. (77.24), and then rescale k by Af; the result is 


6 4 {x-y) -> M 4 J 


d 4 k 

(2 W 


e iMk(x-y) £ -k 2 e 2ik D/M+D 2 /Ap+gS^F^/M 2 


(77.28) 


Thus we have 


Tr 5 4 {x— x )75 



^ p -k 2 r V 2ik-D/M+D 2 /M 2 +gS^F^/M 2 
(2tt) 4 75 


(77.29) 

We can now expand the exponential in inverse powers of Af ; only terms up 
to Af -4 will survive the Af — > 00 limit. Furthermore, the trace over spin 
indices will vanish unless there are four or more gamma matrices multiply- 
ing 75. Together, these considerations imply that the only term that can 
make a nonzero contribution is ^{gS^F^f) 2 / M 4 . Thus we find 


Tr d 4 (x— x )75 -> \g 2 


d 4 k 


e- fc2 (TrF^F pCT )(Tr ^5^75) , 


(77.30) 


where the first trace is over group indices (in the nonabelian case), and the 
second trace is over spin indices. The spin trace is 


Tr 5^5^75 = Tr(|7^)(| 7 V)75 

= — -jTr 7^7 i/ 7 P 7 <T 75 

= i£^ pa . 


(77.31) 
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To evaluate the integral over k in eq. (77.30), we analytically continue 
to euclidean spacetime; this results in an overall factor of i, as usual. Then 
each of the four gaussian integrals gives a factor of 7 r 1 / 2 . So we find 

Tr 5 4 (s-x)ts - - £ pupa TV F pu F p<J . (77.32) 

Using this in eq. (77.19), we get 


(det J) 2 = exp 


167T 2 


J d A x a(x) £ pupa Tr F pv (x)F pa (x) 


(77.33) 


Including the transformation of the measure, eq. (77.18), in the transfor- 
mation of the path integral, eq. (77.16), then yields 


Z(A) 




0 iS(A) 


i f dlx a(x)\(g 2 /mTT 2 )e^ v P a Tr F^ v (x)F pa {x)+d^j l ^{x)\ 


(77.34) 

in place of eq. (77.16). This must be equal to the original expression for 
Z(A), eq. (77.8). This implies that eq. (77.7) holds inside quantum correla- 
tion functions, up to possible contact terms. 

Note that this derivation of eq. (77.7) did not rely on an expansion in 
powers of g, and so eq. (77.7) is exact; there are no higher-order corrections. 
This result is known as the Adler-Bardeen theorem. It can also be (and 
originally was) established by a careful study of Feynman diagrams. 

The Fujikawa method can be used to find the anomaly in the chiral 
gauge theories that we studied in section 75, but the analysis is more in- 
volved. Here we will quote only the final result. 

Consider a left-handed Weyl held in a (possibly reducible) represen- 
tation R of the gauge group. We define the chiral gauge current j ap = 
Its covariant divergence (which should be zero, according to 
Noether’s theorem) is given by 


Dfj b > 1 = 


^e^d p Tr 


T£(A u d p A a - \igA v A p A 0 


(77.35) 


Note that the right-hand side of eq. (77.35) is not gauge invariant. The 
anomaly spoils gauge invariance in chiral gauge theories, unless this right- 
hand side happens to vanish for group-theoretic reasons. We show in prob- 
lem 77.1 that this occurs if and only if H(R) = 0, where H(R) is the anomaly 
coefficient of the representation R. 

For comparison, note that eq. (77.7) can be written as 


<9/Ua = 




A v d p A 0 


^igA I/ A p A cr 


(77.36) 
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The relative value of the overall numerical prefactor in eqs. (77.35) and 
(77.36) is easy to understand: there is a minus one-half in eq. (77.35) from 
P h — * —^ 75 , and a one-third from regularizing to preserve symmetry among 
the three external lines in the triangle diagram. (The relative coefficients 
of the second terms have no comparably simple explanation.) 

Finally, a related but more subtle problem, known as a global anomaly, 
arises for theories with an odd number of Weyl fields in a pseudoreal rep- 
resentation, such as the fundamental representation of SU(2). In this 
case, every gauge field configuration A M can be smoothly deformed into 
another gauge field configuration A ' that has the same action, but has 
Z(A') = — Z(A). Thus, when we integrate over A, the contribution from 
A' cancels the contribution from A, and the result is zero. Since its path 
integral is trivial, this theory does not exist. 

Problems 

77.1) Show that the right-hand side of eq. (77.35) vanishes if and only if 
A(R) = 0. 

77.2) Show that the right-hand side of eq. (77.36) equals the right-hand side 
of eq. (77.7). 
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78 Background Field Gauge 

Prerequisite: 73 


In the section, we will introduce a clever choice of gauge, background field 
gauge, that greatly simplifies the calculation of the beta function for Yang- 
Mills theory, especially at the one-loop level. 

We begin with the lagrangian for Yang-Mills theory, 

C YM = -\F a ^F%, (78.1) 

where the field strength is 

F; v = d»A a v - d„A» + gf^Af . (78.2) 


To evaluate the path integral, we must choose a gauge. As we saw in section 
71, one large class of gauges corresponds to choosing a gauge-fixing function 
G a (x), and adding £ g f + £ g h to £ Y m, where 

£ g f = -\C 1 G a G a , (78.3) 


f)G a 

r , = c a —— n bc c c 

gh 8A b ^ 


(78.4) 


Here D b ff = 5 bc d M — ig(T“) fec A“ = 5 bc d fJ + gf bac Af is the covariant derivative 
in the adjoint representation, and c and c are the ghost and antighost fields. 
The notation dG a /dA b t means that any derivatives that act on Ajj in G a 
now act to the right in eq. (78.4). 

We get gauge by choosing G a = <9U4“. To get background field 
gauge , we first introduce a fixed, classical background field A“(x), and the 
corresponding background covariant derivative, 


D, = d, - igTlAl . (78.5) 

Then we choose 

G a = (Duyb^A-A) 1 ^ . (78.6) 

The ghost lagrangian becomes £ g h = F'D liab D b fc c , or, after an integration 
by parts, 

£ gh = -(D»c) a (D lx c) a , (78.7) 

where (I>c) a = D^ ab c b and (D^c) a = D“ c c c . 

Under an infinitesimal gauge transformation, the change in the fields is 


5 g A“(x) = -d; c o c (x) , 

(78.8) 

S G c b (x) = -i 9 e a {x){T a A ) bc c c {x) , 

(78.9) 

Sg-A^x) = 0 . 

(78.10) 
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The antighost c transforms in the same way as c (since the adjoint repre- 
sentation is real). The background field A is fixed, and so does not change 
under a gauge transformation. Of course, this means that £ g f and £ g h are 
not gauge invariant; their role is to fix the gauge. 

We can, however, define a background field gauge transformation , under 
which only the background field transforms, 


6 bg a;(x) = -d^9 c (x) , 

(78.11) 

^BG^^) = 0 , 

(78.12) 

(5 bg c ( x ) = 0 . 

(78.13) 


Obviously, £ YM is invariant under this transformation (since it does not 
involve the background held at all), but £ g f and £ g h are not. However, 
£ g f and £ g h are invariant under the combined transformation <5 G+BG . For 
£ g h, as given by eq. (78.7), this follows immediately from the fact that 
and have the same transformation property under the combined 
transformation, and that using covariant derivatives with all group indices 
contracted always yields a gauge-invariant expression. 

To use this argument on £ g f, as given by eqs. (78.3) and (78.6), we need 
to show that (A — A)f transforms under the combined transformation in 
the same way as does an ordinary held in the adjoint representation, such 
as the ghost held in eq. (78.9). To show this, we write 

<5 g+bg (H-H)^ = -(. D-D) b *e a 

= + ig(A-A)l(Ti) ba 9 a 

= -ige a (T2) bc (A - A)^ . (78.14) 

We used the complete antisymmetry of (Tf) ba = —if cba to get the last line. 
We see that {A — H)“ transforms like an ordinary held in the adjoint rep- 
resentation, and so any expression that involves only covariant derivatives 
(either D M or D acting on this held, with all group indices contracted, is 
invariant under the combined transformation. 

Therefore, the complete lagrangian, C = £ YM + £ g f + £ g h? invariant 
under the combined transformation. 

Now consider constructing the quantum action T(H, c, c; A). Recall from 
section 21 that the quantum action can be expressed as the sum of all 
1PI diagrams, with the external propagators replaced by the corresponding 
helds. In a gauge theory, the quantum action is in general not gauge 
invariant, because we had to hx a gauge in order to carry out the path 
integral. The quantum action thus depends on the choice of gauge, and 
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hence (in the case of background field gauge) on the background field A. 
This is why we have written A as an argument of T, but separated by a 
semicolon to indicate its special role. 

An important property of the quantum action is that it inherits all lin- 
ear symmetries of the classical action; see problem 21.2. In the present 
case, these symmetries include the combined gauge transformation <5 G+BG . 
Therefore, the quantum action is also invariant under the combined trans- 
formation. The quantum action takes its simplest form if we set the exter- 
nal field A equal to the background field A. Then, T(A, c, c; A) is invariant 
under a gauge transformation of the form 

<5g+bg A^x) = — D^ c 9 c (x) , (78.15) 

5 g+ bgc\x) = - igd a (x)(TZ) bc c c (x ) . (78.16) 

This is now simply an ordinary gauge transformation, with A as the gauge 
field. 

The quantum action can be expressed as the classical action, plus loop 
corrections. For A = A, we have 

T(A, c, c;A) = f A [4 F a ^F* w - (D> i c) a (D fi c) a ]+..., (78.17) 

where the ellipses stand for the loop corrections. Note that £ g f has dis- 
appeared [because we set A = A in eq. (78.6)], and C g h has the form of a 
kinetic term for a complex scalar field in the adjoint representation. This 
term is therefore manifestly gauge invariant, as is the FF term. 

The gauge invariance of the quantum action has an important conse- 
quence for the loop corrections. In background field gauge, the renormal- 
izing Z factors must respect the gauge invariance of the quantum action. 
Therefore, using the notation of section 73, we must have 

Zi = Z 2 , (78.18) 

Z v = Z 2 , , (78.19) 

Z 3 = Z 3g = Z 3g . (78.20) 

Thus the relation between the bare and renormalized gauge couplings be- 
comes 

gl = Zf\j 2 fj £ . (78.21) 

This relation now involves only Z 3 . We can therefore compute the beta 
function from Z 3 alone. This is the major advantage of background field 
gauge. 
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To compute the loop corrections, we need to evaluate 1PI diagrams in 
background-field gauge with the external propagators removed and replaced 
with external fields; the external gauge field should be set equal to the 
background field. The easiest way to do this is to set 

A = A + A (78.22) 

at the beginning, and to write the path integral in terms of A. Then the A 
field appears only on internal lines, and the A field only on external lines. 
The gauge-fixing term now reads 

£ g f = -ir 1 ^) 8 ^) 8 , (78.23) 

and the ghost term is given by eq. (78.7). 

The Feynman rules that follow from C Y M + £ g f + £ g h are closely related 
to those we found in R £ gauge in section 72. The ghost and gluon prop- 
agators are the same, and vertices involving all internal lines are also the 
same. But if one or more gluon lines are external, then there are additional 
contributions to the vertices from £ g f and £ g h- We leave the details to 
problem 78.1. 

Further simplifications arise at the one-loop level. Using eq. (78.22) in 
eq. (78.2), we find 

C = - cU£ + gr h % Al 

+ d,A a u - d v A% + gf abc (A b ,A c v + A\ l A c u ) + gf abc A b ,Al 

= F%, + (D^r - (A,A*)° + 9f abc A\Al • (78.24) 

We then have 

£ym = -\F a ^F; u - ±(^MT(A^r + U^ATiDuA^r 

- y/^F^A^Al + . . . , (78.25) 

where the ellipses stand for terms that are linear, cubic, or quartic in A. 
Vertices arising from terms linear in A cannot appear in a 1PI diagram, and 
the cubic and quartic vertices do not appear in the one-loop contribution 
to the A propagator. 

The last term on the first line of eq. (78.25) can be usefully manipulated 
with some dummy-index relabelings and integrations by parts; we have 

{D lx A v ) a (D v A lx ) a = (D v Ap) c (D»A v ) c 
= -A b yD"D») bc Al 
= -A b ^D^D u - [D^ ,D u ]) bc A c u 
= -A\[D^DA bc Al - ig{T“) bc F a ^A\ L A c u 
= +(D^A fl ) c (D u A u ) c - gf abc F atlv AlA c v . (78.26) 
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Figure 78.1: The one-loop contributions to the A propagator in background 
field gauge; the dashed lines can be either ghosts or internal A gauge Helds. 
The dot denotes the FAA vertex. 

Now the first term on the right-hand side of eq. (78.26) has the same form 
as the gauge- fixing term. If we choose £ = 1, these two terms will cancel. 

Setting £ = 1, and including a renormalizing factor of Z 3 , the terms of 
interest in the complete lagrangian become 

C = -\Z Z F^F; V - ±Z 3 (D^AT(D,Au) a - (D»c) a {D^c) a 

- Z 3 gf abc F atlv A b ^A c u . (78.27) 

In the ghost term, we have replaced D ^ with the vertex corresponding 
to the dropped A term does not appear in the one-loop contribution to 
the A propagator. Also, we can rescale A to absorb Z 3 in all terms except 
the first; since A never appears on an external line, its normalization is 
irrelevant, and always cancels among propagators and vertices. (The same 
is true of the ghost field.) 

The one-loop diagrams that contribute to the A propagator are shown in 
fig. (78.1). The dashed lines in the first two diagrams represent either the A 
field or the ghost fields. In either case, the second diagram vanishes, because 
it is proportional to / which is zero after dimensional regularization. 

Note that the ghost term in eq. (78.27) has the form of a kinetic term 
for a complex scalar field in the adjoint representation. In problem 73.1, 
we found the contribution of a complex scalar field in a representation R cs 
to II(£; 2 ) is 

a 2 1 

n cs (fe 2 ) = — 24^2 T ( R cs) — + finite . (78.28) 

The ghost contribution is minus this, with R cs — ► A. (The minus sign is 
from the closed ghost loop.) Thus we have 

n g h(A: 2 ) = + 2^.2 ~ + fi R it e + 0(g 4 ) . (78.29) 

For reference we recall that the counterterm contribution is 

n ct (fc 2 ) = -(z 3 -i) . 


(78.30) 
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Next we consider the diagrams with A fields in the loop. If the FAA 
interaction term was absent, the calculation would again be a familiar one; 
the DAD A term in eq. (78.27) has the form of a kinetic term for a real 
scalar held that carries an extra index u. That this index is a Lorentz 
vector index is immaterial for the diagrammatic calculation; the index is 
simply summed around the loop, yielding an extra factor of d = 4. There 
is also an extra factor of one-half (relative to the case of a complex scalar) 
because A is real rather than complex. (Equivalently, the diagram has a 
symmetry factor of S = 2 from exchange of the top and bottom internal 
propagators when they do not carry charge arrows.) We thus have 

n DADAik 2 ) = T(A) - £ + finite + 0(g 4 ) . (78.31) 

If we now include the FAA interaction, we can think of F^ u as a constant 
external held. We can then draw the third diagram of hg. (78.1), where and 
each dot denotes a vertex factor of —2 igf abc F^ u . This vacuum diagram has 
a symmetry factor of S = 2 x 2: one factor of two for exchanging the top 
and bottom propagators, and one for exchanging the left and right sources. 
Its contribution to the quantum action is 

faa/VT = |(-2 

= g 2 T(A)F a ^F« v + finite) , (78.32) 

where VT is the volume of spacetime. Comparing this with the tree-level 
lagrangian -jZ-^FF, and recalling eq. (78.30), we see that eq. (78.32) is 
equivalent to a contribution to II(fc 2 ) of 

a 2 1 

II p AA { k 2 ) = T(A) - + finite . (78.33) 

There is also a one-loop diagram with one DADA vertex and one FAA 
vertex; however, contracting the vector indices on the A helds around the 
loop leads to a factor of F^g^ = 0. Similarly, a one- loop diagram with a 
single FAA vertex vanishes. 

We could also couple the gauge held to a Dirac fermion in the rep- 
resentation R df , and a complex scalar in the representation R cs . The 
corresponding contributions to II (k 2 ) were computed in section 73, and are 
given by eq. (78.28) and 

q 2 1 

= x o T ( R df) - + finite , 

07T Z £ 


IIdf^ 2 ) 


(78.34) 
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Adding up eqs. (78.28), (78.29), (78.30), (78.31), (78.33), and (78.34), we 
find that finiteness of II(/c 2 ) requires 


■^3 — 1 + 


9 _ - 
247T 2 - 


(+1-2 + 12) T( A) - 4T(R df ) - T(Rcs)] - + 0(g 4 ) (78.35) 


in the MS renormalization scheme. 

The analysis of section 28 now results in a beta function of 


P(g) = ~ 


9 

48vr 2 L 


11T(A) - 4T(R dp ) - T(R CS ) + 0(g 5 ) . 


(78.36) 


A Majorana fermion or a Weyl fermion makes half the contribution of a 
Dirac fermion in the same representation; a real scalar field makes half the 
contribution of a complex scalar field. (Majorana fermions and real scalars 
must be in real representations of the gauge group.) 

In quantum chromodynamics, the gauge group is SU(3), and there are 
n F = 6 flavors of quarks (which are Dirac fermions) in the fundamental 
representation. We therefore have T(A) = 3, T(R DF ) = ^n F , and T(R CS ) = 
0; therefore 

0(9) = ( n - I n *) + °(9 5 ) ' (78-37) 

We see that the beta function is negative for n F < 16, and so QCD is 
asymptotically free. 


Problems 

78.1) Compute the tree- level vertex factors in background field gauge for all 
vertices that connect one or more external gluons with two or more 
internal lines (ghost or gluon). 

78.2) Our one-loop corrections can be interpreted as functional determi- 
nants. Define 

□r ,( a,6) = D 2 + gT«F; v S^ h) , (78.38) 

where D ^ = <9 M — ig(T“)A c ^ is the background-covariant derivative in 
the representation R, implicitly multiplied by the indentity matrix 
for the (a, b ) representation of the Lorentz group, and are the 

Lorentz generators for that representation; in particular, 

S (U) = 0 • 

^(2,i)e(i,2) = 1 1 ] > 

( S (2,2)U = -iWp - S v a S^) . 


(78.39) 

(78.40) 

(78.41) 
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Show that the one-loop contribution to the terms in the quantum 
action that do not depend on the ghost fields is given by 

exp ?T 1 _ loop (^4, 0 , 0; A) oc (det n Ai(lj i)) +1 

x(det n A (2)2 )) _1/2 
x (det □r DFi ( 2 ,i)©(i, 2 ) ) +1//2 
x(detn Rcs>(1 , 1) )- 1 . (78.42) 

Verify that this expression agrees with the diagrammatic analysis in 
this section. 
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79 Gervais-Neveu Gauge 

Prerequisite: 78 

In section 78, we used background field gauge to set up the computation 
of a quantum action that is gauge invariant. Given this quantum action, 
we can use it to compute scattering amplitudes via the corresponding tree 
diagrams, as discussed in section 19. Since the ghost fields in the quantum 
action do not contribute to tree diagrams, we can simply drop all the ghost 
terms in the quantum action. 

Because the quantum action computed in background field gauge is 
itself gauge invariant, it requires further gauge fixing to specify the gluon 
propagator and vertices. We can choose whatever gauge is most convenient; 
for example, R '.g gauge. In principle, this gauge fixing involves introducing 
new ghost fields, but, once again, these do not contribute to tree diagrams, 
and so we can ignore them. 

If we start with the tree-level approximation to the quantum action, 
then, in R £ gauge, we simply get the gluon propagator and vertices of 
section 72. As we noted there, the complexity of the three- and four-gluon 
vertices in R ^ gauge leads to long, involved computations of even simple 
processes like gluon-gluon scattering. 

In this section, we will introduce another gauge, Gervais-Neveu gauge , 
that simplifies these tree-level computations. 

We begin by specializing to the gauge group SU(IV), and working with 
the matrix- valued field A ^ = A a ^T a . For later convenience, we will normal- 
ize the generators via 

TV T a T b = S ab . (79.1) 

With this choice, their commutation relations become 

[T a , T b ] = iy/2f abc T c . (79.2) 

The tree-level action is specified by the Yang-Mills lagrangian, 

jO YM = -lTrF^F^, (79.3) 

where the matrix-valued field strength is 

G, = GG - - %[A^Au\ ■ (79.4) 

Let us introduce the matrix- valued complex tensor 

= d„A v - %\A U . (79.5) 

Then F fll , is the antisymmetric part of H /w , 

R\iv — N i iv 77 //// • 


(79.6) 



79: Gervais-Neveu Gauge 


474 


The Yang-Mills lagrangian can now be written as 

C YM = . (79.7) 

To fix the gauge, we choose a matrix- valued gauge-fixing function G(x), 
and add 

£ g f = -±TrGG (79.8) 

to £ ym . Here we have set the gauge parameter £ to one, and ignored the 
ghost lagrangian (since, as we have already discussed, it does not affect tree 
diagrams). The choice of G that yields Gervais-Neveu gauge is 

G = 77% . (79.9) 

At first glance, this choice seems untenable, because we see from eq. (79.5) 
that this G (and hence £ g f) is not hermitian. However, because the role of 
£gf is merely to fix the gauge, it is acceptable for £ g f to be nonhermitian. 
Combining eqs. (79.7) and (79.8), we get a total, gauge-fixed lagrangian 

C = -±TY + 7T%i7%) . (79.10) 

Consider the terms in £ with two derivatives. After some integrations by 

parts, those from the third term in eq. (79.10) cancel those from the second, 
leading to 

C 2d = -\Tr& t A v d, x A Vi (79.11) 

just as in gauge with £ = 1. Now consider the terms with no derivatives. 
Once again, those from the third term in eq. (79.10) cancel those from the 
second (after using the cyclic property of the trace), leading to 

£ 0 a = +\g 2 Tr A l ‘A l 'A ll A v . (79.12) 

Finally, we have the terms with one derivative, 

£ id = +% Tr ( y d l, A v A l jA v - 8WA U A^ + 8^A V A U ) . (79.13) 

Each derivative acts only on the held to its immediate right. If we integrate 
by parts in the last term in eq. (79.13), we generate two terms; one of these 
cancels the first term in eq. (79.13), and the other duplicates the second. 
Thus we have 

C ia = - iV2g Tr 8 l A v A v A, l ■ (79.14) 

Combining eqs. (79.11), (79.12), and (79.14), we find 

£ = Tr (-\8^8, l Av - iV2g 8»A v A„Ap + \g 2 AWA^) . (79.15) 
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Because this lagrangian has a rather simple structure in terms of the matrix- 
valued field Ap, it is helpful to stick with this notation, rather than trying 
to reexpress C in terms of = Tr (TM^). In the next section, we explore 
the Feynman rules for a matrix- valued field in a simplified context. 

Reference Notes 

Gervais-Neveu gauge and some interesting variations are discussed in Siegel. 
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80 The Feynman Rules for N x N Matrix 

Fields 

Prerequisite: 10 

In section 79, we found that the lagrangian for SU(IV) Yang-Mills theory 
in Gervais-Neveu gauge is 

C = Tr(-\& t A v d, l A v - iV2g &‘A v A v A i , + \g 2 A^A^aJ) , (80.1) 

where A^fx) is a traceless hermitian N x N matrix. In this section, we will 
work out the Feynman rules for a simplified model of a scalar field that 
keeps the essence of the matrix structure. 

Let B{x) be a hermitian N x N matrix that is not traceless. Let T a be 
a complete set of N 2 hermitian N x N matrices normalized according to 

Tr T a T b = 5 ab . (80.2) 

We will take one of these matrices, T n2 , to be proportional to the identity 
matrix; then eq. (80.2) requires the rest of the T a ’s to be traceless. We can 
expand B(x) in the T a ’s, with coefficient fields B a (x), 

B(x) = B a (x)T a , (80.3) 

B a {x) = Tr T a B{x) , (80.4) 

where the repeated index in eq. (80.3) is implicitly summed over a = 1 to 
N 2 . 

Consider a lagrangian for B{x) of the form 

C = Tr (-±&*BdpB + ±gB 3 - . (80.5) 

Using eqs. (80.2) and (80.3), we find an expression for C in terms of the 
coefficient fields, 

C = -\d^B a d^B a + \gTr(T a T h T c )B a B b B c 

-\\Tr(T a T b T c T d )B a B b B c B d . (80.6) 

It is easy to read off the Feynman rules from this form of C. The propagator 
for the coefficient field B a is 

rafe 

A ab {k 2 ) = -T . (80.7) 

k z — ie 

There is a three-point vertex with vertex factor 2 ig Tr (T a T b T c ), and a four- 
point vertex with vertex factor —6i\Tr(T a T b T c T d ). This clearly leads to 
messy and complicated formulae for scattering amplitudes. 
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Figure 80.1: The double-line notation for the propagator of a hermitian 
matrix field. 




Figure 80.2: 3- and 4-point vertices in the double-line notation. 


Instead, let us work with C in the form of eq. (80.5). Writing the matrix 
indices explicitly, with one up and one down (and employing the rule that 
two indices can be contracted only if one is up and one is down), we have 
B(x)i 3 = B a (x){T a )p . This implies that the propagator for Bp is 


A i j k l (k 2 ) 


C T a )P(T a ) k l 
k 2 — ie 


(80.8) 


Since the T a matrices form a complete set, there is a completeness relation 
of the form {T a )p (T a ) k l oc 5/5^- To get the constant of proportionality, set 
j=k and l—i to turn the left-hand side into {T a )i k (T a ) k = Tr (T a T a ), and 
the right-hand side into 5i l 5k k = A 2 . From eq. (80.2) we have Tr (T a T a ) = 
5 aa = N 2 . So the constant of proportionality is one, and 


C T a )i j (T a )k l = 6i l 6 k i . 


(80.9) 


We can represent the B propagator with a double-line notation, as 
shown in fig. (80.1). The arrow on each line points from an up index to 
a down index. Since the interactions are simple matrix products, with an 
up index from one field contracted with a down index from an adjacent 
field, the vertices follow the pattern shown in fig. (80.2). Since an n-point 
vertex of this type has only an n-fold cyclic symmetry (rather than an n\- 
fold permutation symmetry), the vertex factor is i times the coefficient of 
Tr (B n ) in C times n (rather than ?r!). Thus, for the lagrangian of eq. (80.5), 
the 3- and 4-point vertex factors are ig and — i\. 

Now consider a scattering process. Particles corresponding to the coef- 
ficient fields labeled by the indices a\ and 02 (and with four-momenta k\ 
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Figure 80.3: Tree diagrams with four external lines. Five more diagrams 
of each of these three types, with the external labels 2, 3, and 4 permuted, 
also contribute. 


and k 2 ) scatter into particles corresponding to the coefficient fields labeled 
by the indices 03 and 04 (and with four-momenta k 3 and /c'4). We wish to 
compute the scattering amplitude for this process, at tree level. 

There are 18 contributing Feynman diagrams. Three are shown in 
fig. (80.3); the remaining 15 are obtained by making noncylic permutations 
of the labels 1,2, 3, 4 (equivalent to making unrestricted permutations of 
2,3,4). For simplicity, we will treat all external momenta as outgoing; 
then ki and k® are negative, and k\ + k 2 + &3 + &4 = 0. Each external 
line carries a factor of T a \ with its matrix indices contracted by following 
the arrows backward through the diagrams. Omitting the ie’s in the prop- 
agators (which are not relevant for tree diagrams), the resulting tree- level 
amplitude is 


iT 


Tr (T ai T“ 2 T 0,3 T“ 4 ) 


/ (zg) 2 H) (igf(-i) 
\{ki+k 2 ) 2 (fci+fei) 2 



+ ((234) -»• (342), (423), (243), (432), (324)) . (80.10) 

More generally, we can see that the value of any tree-level diagram with 
n external lines is proportional to Tr(T a n . . . T ain ). If the diagram is drawn 
in planar fashion (that is, with no crossed lines), then the ordering of the 
di indices in the trace is determined by the cyclic ordering of the labels 
on the external lines (which we take to be couterclockwise) . Then, each 
internal line contributes a factor of — i/k 2 , each 3-point vertex a factor of 
ig, and each four-point vertex a factor of — iX. These are the color-ordered 
Feynman rules for this theory. 

Return now to iT as given by eq. (80.10). Suppose that we wish to 
square this amplitude, and sum over all possible particle types for each 
incoming or outgoing particle. We then have to evaluate expressions like 


Tr(T ai T a2 T“ 3 T a4 ) [Tr(T ai r [' a2r [' a ^ r [' a3 


(80.11) 
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Figure 80.4: Evaluation of Tr(T ai T a2 T a3 T a4 )[Tr(T ai T a2 T a4 T a3 )]* , with all 
repeated indices summed. Each of the two closed single-line loops yields a 
factor of Sj l = N. 


with all repeated indices summed. Using the hermiticity of the T a matrices, 
we have 


[Tr(T ai . . . T an )]* = Tr(T a " . . . T ai ) , (80.12) 


It is then easiest to evaluate eq. (80.11) diagrammatically, as shown in 
fig. (80.4). Each closed single-line loop yields a factor of Si 1 = N. The 
result is that the absolute square of any particular trace yields a factor of 
N 4 , and the product of any trace times the complex conjugate of any other 
different trace yields a factor of N 2 . 

The coefficient of both Tr(T ai T a2 T a3 T a4 ) and Tr(T ai T a4 T a3 T a2 ) in 
eq. (80.10) is 


( ki+k 2 ) 2 + (Aq+fct ) 2 


(80.13) 


Similarly, the coefficient of both Tr(T ai T a3 T a4 T a2 ) and Tr(T ai T a2 T a4 T a3 ) 
is 


A 4 


(. h+h ) 2 + (h +k 2 ) 2 


(80.14) 


and of both Tr(T ai T a4 T a2 T a3 ) and Tr(T ai T a3 T a2 T a4 ) is 


2 (ki+ki) 2 + (ki+k3) 2 

Thus we have 


(80.15) 


53 \t\ 2 = ( 2n4 + 2n2 ) J2 \ a j \ 2 + 4iy2 53 A ] A k 

0,1,0,2,0,3,114 j j^k 


= (21V 4 -21V 2 ) 53 I A,- 1 2 + 41V 2 (53^*) (53 A fc ) , (80.16) 

3 3 k 


where j and k are summed over 2, 3,4. 
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j k N j k 


Figure 80.5: The propagator for a traceless hermitian field. 

Now suppose we wish to impose the condition that the matrix field B 
is traceless: Tr B = 0. This means that we eliminate the component field 
with a=N 2 , corresponding to the matrix T n2 = A -1 / 2 /. We must also 
eliminate T N from the sum in eq. (80.9), leading to 

(T a )^(T a ) k l = 5i l 6 k i - ±8i j 5 k l . (80.17) 

This can all be done diagrammatically by replacing the propagator in 
fig. (80.1) with the one in fig. (80.5). (The kinematic factor, —i/k 2 , is un- 
changed.) Fig. (80.5) must now be used as the internal propagator in the 
diagrams of fig. (80.3). Also, when we multiply one diagram by the complex 
conjugate of another in the computation of J2ai...a n |^| 2 > we must use the 
propagator of fig. (80.5) to connect the external line of one diagram with 
the matching external line of the complex conjugate diagram. Although 
these computations are still straightforward, they can become considerably 
more involved. 

Problems 

80.1) Show that the color-ordered Feynman rules, and the rules for compo- 
nent fields given after eq. (80.6), agree in the case N = 1. 

80.2) Verify the results quoted after eq. (80.12). 

80.3) Compute X)ai ,a 2 ,a 3 ,a 4 \Tr(T ai T a2 T a3 T a4 )\ 2 for the case of traceless 
T a, s. 

80.4) The large- N limit. Let A = eg 2 , where c is a number of order one. Now 
consider evaluating the path integral, without sources, as a function 
of g and N, 

Z(g,N ) = e iW{9 ’ N) = J VBe i S ddxC , (80.18) 

where W(g,N ) is normalized by VF(0, N) = 0. As usual, W can be 
expressed as a sum of connected vacuum diagrams, which we draw 
in the double-line notation. Consider a diagram with V 3 three-point 
vertices, V 4 four-point vertices, E propagators or edges, and F closed 
single-line loops or faces. 
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a) Find the dependence on g and N of a diagram specified by the 
values of V 3 , V4, E, and F. 

b) Express E for a vacuum diagram in terms of V 3 and V4. 

c) Recall, derive, or look up the formula for the Euler character x of 
the two-dimensional surface of a polyhedron in terms of the values of 
V = V;s + V4, E, and F. The Euler character is related to the genus 
Q of the surface by x = 2 — 2Q-, Q counts the number of handles, so 
that a sphere has genus zero, a torus has genus one, etc. 

d) Consider the limit g — ► 0 and N — > 00 with the ’t Hooft coupling 
A = g 2 N held fixed (and not necessarily small). Show that W(A, N) 
has a topological expansion of the form 

OO 

W{ A, N) = J2 N 2 ~ 2G Wg( A) , (80.19) 

Q = 0 

where Wg (A) is given by a sum over diagrams that form polyhedra 
with genus Q. In particular, the leading term, I'Fo(A), is given by 
a sum over diagrams with spherical topology, also known as planar 
diagrams. 
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81 Scattering in Quantum Chromodynamics 

Prerequisite: 60, 79, 80 

In section 79, we found that the lagrangian for SU(IV) Yang-Mills theory 
in Gervais-Neveu gauge is 

£ = Tt(— - iV2g d^A u A u A p + A'A v A p A v ) , (81.1) 

where A p (x) is a traceless hermitian N x N matrix. For quantum chromo- 
dynamics, N = 3, but we will leave N unspecified in our calculations. In 
section 80, we worked out the color-ordered Feynman rules for a scalar ma- 
trix field; the same technology applies here as well. In particular, we draw 
each tree diagram in planar fashion (that is, with no crossed lines). Then 
the cyclic, counterclockwise ordering i\ . . . i n of the external lines fixes the 
color factor as Tr(T ai i . . .T ain ), where the generator matrices are normal- 
ized via TV (T a T b ) = 5 ab . The tree- level n-gluon scattering amplitude is 
then written as 

T = g n ~ 2 J2 Tr (T ai . . . T a ")A( 1, . . . , n) , (81.2) 

noncyclic 

perms 

where we have pulled out the coupling constant dependence, and A(l , . . . , n) 
is a partial amplitude that we compute with the color-ordered Feynman 
rules. The partial amplitudes are cyclically symmetric, 

A(2,...,n,l) =A(l,2,...,n) . (81.3) 

The sum in eq. (81.2) is over all noncyclic permutations of 1 ... n, which is 
equivalent to a sum over all permutations of 2 ... n. 

From the first term in eq. (81.1), we see that the gluon propagator is 
simply 

A„„(k) = . (81.4) 

Here we have left out the matrix indices since we have already accounted 
for them with the color factor in eq. (81.2). The second and third terms 
in eq. (81.1) yield three- and four-gluon vertices. The three- gluon vertex 
factor (again without the matrix indices) is 

™ P v P {j>,q,r) = i{-iV2g)(-ip p gp „ ) 

+ [2 cyclic permutations of (fr,p), (v,q), ( p,r ) ] 

= -iV2g(p p g pu + q p g up + r u g pp ) , 


(81.5) 
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where the four-momenta p, q, and r are all taken to be outgoing. The 
four-gluon vertex factor is simply 

fu / per — dj fj/rpfjlsa ■ (81.6) 


However, in the context of the color-ordered rules, it is simpler to designate 
the outgoing four-momentum on each external line as k{, and contract the 
vector index with the corresponding polarization vector £j. (For now we 
suppress the helicity label A = ±.) In this notation, the vertex factors 
become 


iVi23 = -iV2g\{£i£ 2 )(ki£ 3 ) + + (£3£i){h^2) 


(81.7) 


*Vi234 — +ig 2 (,£i£3)(,£2£a) , 


(81.8) 


where the external lines are numbered sequentially, counterclockwise around 
the vertex. (Of course, if an attached line is internal, the corresponding 
polarization vector is simply a placeholder for an internal propagator.) 

The color-ordered three-point vertex, eq. (81.7), is antisymmetric on the 
reversal 123 321, while the four-point vertex, eq. (81.8), is symmetric on 

the reversal 1234 4321. This implies the reflection identity, 


A(n , . . . , 2, 1) = (— l)VL(l,2,...,n) , (81.9) 


which will be useful later. 

It is clear from eqs. (81.7) and (81.8) that every term in any tree-level 
scattering amplitude is proportional to products of polarization vectors 
with each other, or with external momenta. (Actually, this follows directly 
from Lorentz invariance, and the fact that the scattering amplitude is linear 
in each polarization.) We get one momentum factor from each three-point 
vertex. Since every tree diagram with n external lines has no more than 
n— 2 vertices, there are no more than n — 2 momenta to contract with the 
n polarizations. Therefore, every term in every tree-level amplitude must 
include at least one product of two polarization vectors. Then, if the prod- 
uct of every possible pair of polarization vectors vanishes, the tree-level 
amplitude for that process is zero. 

We will now show that this is indeed the case if all, or all but one, of 
the external gluons have the same helicity. (Here we are using the seman- 
tic convention of section 60: the helicity of an external gluon is specified 
relative to the outgoing four-momentum ki that labels the corresponding 
external line. If that gluon is actually incoming — as indicated by a negative 
value of kd - — then its physical helicity is opposite to its labeled helicity.) 
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To proceed, we recall from section 60 some formulae for products of 
polarization vectors in the spinor-helicity formalism, 


£+(k;q)-£ + (k';q') 

(q q') [k k'} 
{q k) ( q' k ') 

(81.10) 

£-(k;q)-£-(k';q') 

[q q’} (k k') 

[q k ] [q' k'} ’ 

(81.11) 

£+(k;q)-£-(k';q') 

{q k') [k q'] 
(q k) [q' k '] 

(81.12) 


The first argument of each e is the momentum of the corresponding line; 
the second argument is an arbitrary reference momentum. Recall that the 
twistor producs (q k) and [q k] are antisymmetric, and hence (q q) = [qq] = 
0. Using this fact in eq. (81.10), we see that choosing the same reference 
momentum q for all positive-helicity polarizations results in a vanishing 
product for any pair of them. Furthermore, if we choose this q equal to 
the momentum k' of a negative-helicity gluon, eq. (81.12) tells us that the 
product of its polarization with that of any positive-helicity gluon also 
vanishes. Thus, if all, or all but one, of the external gluons have positive 
helicity, all possible polarization products are zero, and hence the tree-level 
scattering amplitude is also zero. Thus we have shown that 

^(l ± ,2 + ,...,n + ) = 0, (81.13) 

where the superscripts are the helicities. Of course, the same is true if all, 
or all but one, of the helicities are negative, 

^(l ± ,2“,...,n“) = 0 . (81.14) 

Now we turn to the calculation of some nonzero tree-level partial am- 
plitudes, beginning with A(\~ , 2~, 3 + , 4 + ). The contributing color-ordered 
Feynman diagrams are shown in fig. (81.1). We choose the reference mo- 
menta to be q\ = q ‘2 = k 3 and q 3 = q 4 = k 2 ■ Then all polarization products 
vanish, with the exception of 


£i-£ 4 = £-(k 1 ,q 1 )-£ + (k 4 ,q 4 ) 

= £-(k 1 ,k 3 )-£ + (k 4 ,k 2 ) 

_ (21) [4 3] 

' (24) [31] ' 


(81.15) 


With this choice of the reference momenta, the third diagram in fig. (81.1) 
obviously vanishes, because it has a factor of £i-£ 3 = 0 (and also, for good 
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Figure 81.1: Diagrams for the partial amplitude A{ 1,2, 3,4). 

measure, £ 2 -£4 = 0). Now consider the 235 vertex in the second diagram; 
we have 

V 235 OC (£ 2 £ 3 )(& 2 £ 5 ) + (£3£5)(&3£2) + (£ 5 ^ 2 ) (& 5 S 3 ) , (81.16) 

where £5 is a placeholder for an internal propagator. The first term in 
eq. (81.16) vanishes because £2 -£3 = 0. The second term vanishes because 
k 3 = q 2 and ( 72 '£2 = 0. Finally, the third term vanishes because k$ = 
— & 2 — = — 53 —^ 3 , and (/ 3 -£3 = & 3 -£3 = 0. Hence the 235 vertex vanishes, 
and therefore so does the second diagram. 

That leaves only the first diagram. We then have 

^ 2 -4(l _ ,2”,3 + ,4 + ) = (fVi25)(iV 3 4 5 /) ig fiv/ Si2 > (81.17) 

where 5' means the momentum is — k§ rather than £ 5 , and 

s 12 = -(k 1 + k 2 ) 2 = (12)[21}. (81.18) 

We have 

iVi25 = -iy/2g (£i£ 2 )(fcl£ 5 ) + (e 2 £ 5 )(k 2 £i) + {£^l){k b £ 2 ) , (81.19) 

but the first term vanishes because £i -£2 = 0. Similarly, the first term of 

iV 345 ' = -iV2g (£ 3 £4)(^’3£5) + (£4£5)(fc4£3) + (£ 5 £ 3 )(-fc 5 £ 4 ) (81.20) 

also vanishes. When we take the product of these two vertices, and replace 
the internal polarizations with the propagator, as indicated in eq. (81.17), 
only the product of the third term of eq. (81.19) with the second term 
of eq. (81.20) is nonzero; all other terms include a vanishing product of 
polarizations. We get 


ig 2 A(l ,2 ,3 + ,4 + ) = (-i\/2£f) 2 (i/si 2 )(£i£4)(A:5£2)(fe4£3) • (81.21) 
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Since k§ = — k\—k 2 , and k 2-£2 = 0, we have k$ ■ e 2 
fci-62 and /c 4-£3 via the general formulae 


p-£ + {t,q) 


(qp) [pfe] 
V2{qk) 


p-£~(k;q) 


[qp] (pk) 

v/2 [<? fc] 


Setting ^2 = k$ and <73 = k 2 , we get 


h -£2 


h -£ 3 


[ 31 ] ( 12 ) 
v/2 [3 2] ’ 

(24) [43] 
\/2 (2 3) 


-k\-£ 2 - We evaluate 


(81.22) 

(81.23) 


(81.24) 

(81.25) 


Using eqs. (81.15), (81.18), (81.24), and (81.25) in eq. (81.21), and using 
antisymmetry of the twistor products to cancel common factors, we get 


.4(l-,2-,3+,4+) 


(21) [4 3] 2 
[21] [3 2] (2 3) ' 


(81.26) 


We can make our result look nicer by multiplying the numerator and 
denominator by (34). In the numerator, we use 


(3 4) [4 3] = S34 = S12 = (1 2) [2 1] , (81.27) 


and cancel the [2 1] with the one in the denominator. Now multiply the nu- 
merator and denominator by (41), and use the momentum-conservation 
identity (see problem 60.2) to replace (41) [4 3] in the numerator with 
— (21) [2 3], and cancel the [2 3] with the [3 2] in the denominator (which 
yields a minus sign). Finally, multiply the numerator and denominator by 
(12) to get 


A( l-,2-,3+,4+) 


(1 2) 4 

(1 2) (2 3) (3 4) (4 1) ' 


(81.28) 


This is our final result for A( 1 _ , 2”, 3 + ,4 + ). 

Now, using cyclic symmetry, we can get any partial amplitude where 
the two negative helicities are adjacent; for example, 


A(l+,2-,3-,4+) 


(2 3) 4 

(1 2) (2 3) (3 4) (4 1) ' 


(81.29) 


We must still calculate one partial amplitude where the negative helicities 
are not adjacent, such as A( 1“, 2 + , 3 _ , 4 + ). Once we have it, we can use 
cyclic symmetry to get all the remaining partial amplitudes. 
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Before turning to this calculation, let us consider the problem of squar- 
ing the total amplitude and summing over colors. Because the generator 
matrices are traceless, we should (as we discussed in section 80) use the 
completeness relation 

(T a )^(T a ) k l = 5i l 5 k i - • (81.30) 

However, recall that the Yang-Mills field strength is 

F P:V = 8 P A U - d v A p - %[A^A V \ . (81.31) 

If we allow a generator matrix proportional to the identity, which corre- 
sponds to a gauge group of U(iV) rather than SU (N), then this extra U(l) 
generator commutes with every other generator. Thus the U(l) field does 
not appear in the commutator term in eq. (81.31). Since it is this commuta- 
tor term that is responsible for the interaction of the gluons, the U(l) field 
is a free held. Therefore, any scattering amplitude involving the associated 
particle (which we will call the fictitious photon) must be zero. Thus, if we 
write a scattering amplitude in the form of eq. (81.2), and replace one of 
the T a ’s with the identity matrix, the result must be zero. 

This decoupling of the fictitious photon allows us to use the much simpler 
completeness relation 

(T a )S(T a ) k l =6i l 6 k j (81.32) 

in place of eq. (81.30). There is no need to subtract the U(l) generator 
from the sum over the generators, as we did in eq. (81.30), because the 
terms involving it vanish anyway. 

The decoupling of the fictitious photon is useful in another way. Let us 
apply it to the case of n = 4, and set T“ 4 oc I in eq. (81.2). Then we have 


0 = Tr(T ai T a2 T a3 )|H(l,2,3,4) + A(l,2,4,3) + A(l,4,2,3) 

+ Tr(T ai T a3 T a2 )[H(l,3,2,4) + A(l, 3, 4, 2) + A(l, 4, 3, 2) 


(81.33) 


The contents of each square bracket must vanish. Requiring this of the first 
term yields 


71(1,2,3,4)= -A(l,2,4,3)-A(l,4,2,3) . (81.34) 

Assigning some helicities, this reads 

A(l“,2 + ,3“,4 + ) = - A(l',2 + ,4 + ,3“) - A(l",4 + ,2 + ,3“) . (81.35) 

Note that we have now expressed a partial amplitude with nonadjacent 
negative helicities in terms of partial amplitudes with adjacent negative 
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helicities, which we have already calculated. Thus we have 


A(l-,2+, 3",4+) 


(3 l ) 4 (3 l ) 4 

(3 1)(1 2) (2 4) (4 3) (31) (14) (4 2) (2 3) 


(1 3) 3 

r i ii 

(2 4) 
(1 3) 3 

L (1 2) (3 4) (14) (2 3) J 

" (1 4) (2 3) + (1 2) (3 4) " 

1-1 "to 

CO 

L (1 2)(34)(1 4) (2 3) J 
r -(13) (4 2) 1 

(2 4) 

. (1 2) (3 4) (1 4) (2 3) J ’ 


(81.36) 


where the last line follows from the Schouten identity (see problem 50.3). 
A final clean-up yields 


A(l“,2+ 3",4+) 


( 13) 4 

(1 2) (2 3) (3 4) (4 1) ' 


(81.37) 


Now that we have all the partial amplitudes, we can compute the color- 
summed \T\ 2 . There are only three partial amplitudes that are not related 
by either cyclic permutations, eq. (81.3), or reflections, eq. (81.9); we can 
take these to be 


A 3 = A(l, 2,3,4) , (81.38) 

A 4 = A( 1,3, 4, 2), (81.39) 

A 2 = A(l, 4,2,3) , (81.40) 

where the subscript on the left-hand side is the third argument on the 
right-hand side. (Switching the second and fourth arguments is equivalent 
to a reflection and a cyclic permutation, and so leaves the partial amplitude 
unchanged.) This mimics the notation we used at the end of section 80, 
and we can apply our result from there to the color sum, 

E i T i 2 = 2-/v 2 (iv 2 — l)# 4 e i^-i 2 + 4ivy (E^*)(E^) > ( 81 - 41 ) 

colors j j k 

where j and k are summed over 2, 3, 4. In the present case, however, 
eq. (81.34) is equivalent to J2jAj = 0, so the second term in eq. (81.41) 
vanishes. Our result for the color-summed squared amplitude is then 

E \T\ 2 = 2N 2 (N 2 -l)g 4 (\A 2 \ 2 + |A 3 | 2 + |A 4 | 2 ) . (81.42) 

colors 
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Figure 81.2: Color-ordered diagrams for qqgg scattering. 



Figure 81.3: The double-line version of fig. (81.2); the associated color factor 

is (T a3 T a4 ) i2 il . 

For the case where 1 and 2 are the incoming gluons, and 3 and 4 are 
the outgoing gluons, we can write this in terms of the usual Mandelstam 
variables s = s\2 = S34, t = s 1 3 = S24, and u = S14 = S23 by recalling that 
|(1 2) | 2 = | [1 2] | 2 = | s 12 1 , etc. Let us take the case where gluons 1 and 2 
have negative helicity, and 3 and 4 have positive helicity. In this case, we 
see from eqs. (81.28) and (81.37) that the numerator in every nonvanishing 
partial amplitude is (1 2) 4 . Then we get 

X l T li- 2 - 3 + 4 + = 2N 2 (N 2 -l)g 4 s 4 j + ^2 + ^ 2 ) • (81-43) 

colors x 

We can also sum over helicities. There are six patterns of two positive 

and two negative helicities; b+ and +- 1 yield a factor of .s 4 , — -| b 

and -| 1 — yield t 4 , and — +- ( — and -| b yield u 4 . The helicity sum is 

therefore 

X |T| 2 = 4N 2 (N 2 —l)g 4 (s 4 + t l +u 4 ) (-^ + ^ . (81.44) 

colors 7 

helicities 

Of course, we really want to average (rather than sum) over the initial 
colors and helicities; to do so we must divide eq. (81.44) by 4(N 2 — l) 2 . 

Next we turn to scattering of quarks and gluons. We consider a single 
type of massless quark: a Dirac field in the N representation of SU(1V). 
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The lagrangian for this field is C = where the covariant derivative 

is D fl = <9 m — ( ig/y/2)A Thus the color-ordered vertex factor is 

= (• ig/y/2 )y* . (81.45) 

To get the color factor, we use the double-line notation, with a single line 
for the quark. As an example, consider the process of qq — > (and its 

crossing-related cousins) . The contributing color-ordered tree diagrams are 
shown in fig. (81.2). The corresponding double-line diagrams are shown in 
fig. (81.3); the quark is represented by a single line, with an arrow direction 
that matches its charge arrow. To get the color factor, we start with line 2, 
and follow the arrows backwards; the result is (T a 3 T a 4 )j 2 * 1 . The complete 
amplitude can then be written as 

T = g 2 [{T as T a Q i 2 ^A(lg, 2 q , 3, 4) + A(l q , 2„ 4, 3)] , (81.46) 

where A(\ q , 2 g ,3, 4) is the appropriate partial amplitude. The subscripts q 
and q indicate the labels that correspond to an outgoing quark and outgoing 
antiquark, respectively. 

From our results for spinor electrodynamics in section 60, we know that 
a nonzero amplitude requires opposite helicities on the two ends of any 
fermion line. Consider, then, the case of T_ + \ 3 \ 4 . The partial amplitude 
corresponding to the diagrams of fig. (81.2) is 

ig 2 A( 15,2+3,4) = {ig/VWmV\h(-A/pl)m 

+ {i 9 /V2)[2\m^\ 4£l _ , ig ^ /si2 • ( 8 !- 4 7 ) 

Suppose both gluons have positive helicity. Then using 

? + {k;q) = ( \k]{q\ + \q)[k \ ) , (81.48) 

we can get both lines of eq. (81.47) to vanish by choosing (/:-> = q± = p\. 
Similarly, if both gluons have negative helicity, then using 

L(k]q) = ( \k)[q\ + \q](k\ ) , (81.49) 

we can get both lines of eq. (81.47) to vanish by choosing c /3 = qq = P 2 ■ So 
the gluons must have opposite helicities to get a nonzero amplitude. 

Consider, then, the case of A 3 = + and A 4 = — . We can get V 345 to 
vanish by choosing q% = k .4 and q± = A; 3 . The partial amplitude is then 
given by just the first line of eq. (81.47), 

A(ir,2+,3 + ,4-) = 5[2|^3 + (V 1 +^ 4 -|1)/(-s 14 ) • 


(81.50) 
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With q 3 = k 4 and q 4 = k;$, 

we have 


^3+ 

= ^|(|4)[3| + |3]<4|) , 

(81.51) 

^4“ 

= ^(WPI + fflW)- 

(81.52) 

Using the identity 

II 

1 

1 

(81.53) 

eq. (81.50) becomes 




A(ir,2+,3 + ,4'' 


[2 3] (41) [13] (41) 
(43) [34] s 14 


(81.54) 


In the numerator, we use [13] (41) = —[2 3] (4 2). In the denominator, we 
set S 14 = S 23 = (2 3) [3 2] . Then we multiply the numerator and denomina- 
tor by (12), and use [23] (12) = —[43] (14) in the numerator. Finally we 
multiply both by (14), and rearrange to get 


A(ir,2+ 3 + ,4“) 
An analogous calculation yields 

A(ir,2+,3-,4+) 


(1 4) 3 (2 4) 

(1 2) (2 3) (3 4) (4 1) ' 


(1 3 ) 3 (2 3) 

(12) (2 3) (3 4) (41) ' 


(81.55) 


(81.56) 


The remaining nonzero amplitudes are related by complex conjugation. 

Now that we have all the partial amplitudes, we can compute the color- 
summed |T| 2 . To do so, we multiply eq. (81.46) by its complex conjugate, 
and use hermiticity of the generator matrices to get 


Ei r i 2 

colors 


rjTj, rj-tb rj-ib rjnCL 


)(lVsl 2 + |A,i 


+ Tr{T a T b T a T b ) (a%A a + A* 4 A 3 ^ , 


(81.57) 


where A 3 = A( 1^, 2 g ,3, 4) and A 4 = A(lq, 2 q , 4, 3). The traces are easily 
evaluated with the double-line technique of section 80; because the ficti- 
tious photon couples to the quark, we must use eq. (81.30) to project it 
out. The traces in eq. (81.57) are also easily evaluated with the group- 
theoretic methods of section 70, with the normalization that the index of 
the fundamental representation is one: T(N) = 1. Either way, the results 


are 


Tr( T a T b T b T a) = +(JV 2 _l ) 2 /AT 
Tr(T a T b T a T b ) = -(N 2 -l)/N . 


(81.58) 

(81.59) 
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The sum over the four possible helicity patterns ( — bH — , — I b, H b, 

H 1 — ) is left as an exercise. 

Now that we have calculated these scattering amplitudes for quarks 
and gluons, an important questions arises: why did we bother to do it? 
Quarks and gluons are confined inside colorless bound states, the hadrons, 
and so apparently cannot appear as incoming and outgoing particles in a 
scattering event. 

To answer this question, suppose we collide two hadrons with a center- 
of-mass energy E = \fs large enough so that the QCD coupling g is small 
when renormalized in the MS scheme with g = E. (In the real world, we 
have a = g 2 / Air = 0.12 for g = M z = 91GeV.) Then we can think of 
each hadron as being made up of a loose collection of quarks and gluons, 
and these parts of a hadron, or partons, can be treated as independent 
participants in scattering processes. In order to extract quantitative results 
for hadron scattering (a project beyond the scope of this book), we need 
to know how each hadron’s energy and momentum is shared among its 
partons. This is described by parton distribution functions. At present, 
these cannot be calculated from first principles, but they have to satisfy 
a variety of consistency conditions that can be derived from perturbation 
theory, and that relate their values at different energies. These conditions 
are well satisfied by current experimental data. 

Reference Notes 

More detail on how hadron scattering experiments can be compared with 
parton scaterring amplitudes can be found in Peskin & Schroeder, Muta , 
Quigg, and Sterman. 


Problems 

81.1) Compute the four-gluon partial amplitude A( l _ ,2 + ,3 _ ,4 + ) directly 
from the Feynman diagrams, and verify eq. (81.37). 

81.2) Compute the qqgg partial amplitude A(l^, 2+, 3~ , 4 + ) with q 4 = p\ 
and (/3 = £14. Show that, with this choice of the reference momenta, 
the first line of eq. (81.47) vanishes. Evaluate the second line, and 
verifiy eq. (81.55). 

81.3) Compute A(l^, 2+, 3 _ , 4 + ), and verify eq. (81.56). 

81.4) a) Verify eqs. (81.58) and (81.59) using the double-line notation of 
section 80. 

b) Compute Tr (T“T|T|T“) and Tr (T“T|T“T|) in terms of the index 
T(R) and dimension D (R) of the representation R, and the index 
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T( A) and dimension D{ A) of the adjoint representation. Verify that 
your results reproduce eqs. (81.58) and (81.59). 

81.5) Compute the sum over helicities of eq. (81.57). Express your answer 
in terms of s, t , and u for the process qq —> gg. 

81.6) Consider the partial amplitude A(l^, 2+, 3 _ , 4 + , 5 + ). Show that, with 
the choice q% = &2 and q 4 = q$ = k \ , there are just two contributing 
diagrams. Evaluate them. After some manipulations, you should be 
able to put your result in the form 


A( ir,2+3-,4+,5+) 


(1 3) 3 (2 3) 

(1 2) (2 3) (3 4) (4 5) (5 1) ' 


(81.60) 
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82 Wilson Loops, Lattice Theory, and 

Confinement 

Prerequisite: 29, 73 

In this section, we will contruct a gauge-invariant operator, the Wilson loop, 
whose vacuum expectation value (VEV for short) can diagnose whether 
or not a gauge theory exhibits confinement. A theory is confining if all 
finite-energy states are invariant under a global gauge transformation. U(l) 
gauge theory — quantum electrodynamics — is not confining, because there 
are finite-energy states (such as the state of a single electron) that have 
nonzero electric charge, and hence change by a phase under a global gauge 
transformation. 

Confinement is a nonperturbative phenomenon; it cannot be seen at any 
finite order in the kind of weak-coupling perturbation theory that we have 
been doing. (This is why we had no trouble calculating quark and gluon 
scattering amplitudes.) In this section, we will introduce lattice gauge the- 
ory, in which spacetime is replaced by a discrete set of points; the inverse 
lattice spacing 1/a then acts as an ultraviolet cutoff (see section 29). This 
cutoff theory can be analyzed at strong coupling, and, as we will see, in 
this regime the VEV of the Wilson loop is indicative of confinement. The 
outstanding question is whether this phenomenon persists as we simul- 
taneously lower the coupling and increase the ultraviolet cutoff (with the 
relationship between the two governed by the beta function), or whether we 
encounter a phase transition, signalled by a sudden change in the behavior 
of the Wilson loop VEV. 

We take the gauge group to be SU(IV). Consider two spacetime points 
x ^ and where is infinitesimal. Define the Wilson link 

W(x+e,x) = exp [ige^A^x)} , (82.1) 

where A /t (x) is an N x N matrix- valued traceless hermitian gauge field. 
Since e is infinitesimal, we also have 

W (x+e, x) = I + ig^A^x) + 0{£ 2 ) . (82.2) 

Let us determine the behavior of the Wilson link under a gauge trans- 
formation. Using the gauge transformation of Affix) from section 69, we 
find 

W(x+£,x) — y 1 + ige^U(x)Ap(x)U\x) — e^U (x)d fl U Jf (x) , (82.3) 

where U (x) is a spacetime-dependent special unitary matrix. Since UU' = 
1, we have —Ud^U^ = -\-(d^,U)W; thus we can rewrite eq. (82.3) as 

W (x+e, x ) — ■> ^(1 + E^d^U (x)) U\x) + ig£^JJ (x)A fl (x)U\x) . (82.4) 
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In the first term, we can use (1 + e tl d lx )U(x) = U(x+£ ) + 0(e 2 ). In the 
second term, which already contains an explicit factor of we can replace 
U(x ) with U(x+e) at the cost of an 0(e 2 ) error. Then we get 

W{x+e,x) — > U(x+e)(l + igp 1 A^(x)^jU\x) , (82.5) 

which is equivalent to 

W(x+e,x) — > U(x+s)W(x+£, x)U\x) . (82.6) 

Note also that eq. (82.1) implies W'{x+£,x) = W(x—s,x). We can 
shift x to x + e at the cost of an 0(e 2 ) error, and so 

W\x+£,x) = W{x,x+e) , (82.7) 

which is consistent with eq. (82.6). 

Now consider mutiplying together a string of Wilson links, specified by 
a starting point x and n sequential infinitesimal displacement vectors £j. 
The ordered set of e’s defines a path P through spacetime that starts at x 
and ends at y = x + £\ + . . . + £ n . The Wilson line for this path is 

Wp(y, x) = W (y, y-£ n ) ... W (x+£i+£ 2 , x+£i)W {x+£\,x) . (82.8) 

Using eq. (82.6) and the unitarity of U(x), we see that, under a gauge 
transformation, the Wilson line transforms as 

W P (y,x) - U(y)W P (y, x)U^(x) . (82.9) 

Also, since hermitian conjugation reverses the order of the product in 
eq. (82.8), using eq. (82.7) yields 

W ] p (y y,x) = W- P (x, y) , (82.10) 

where — P denotes the reverse of the path P. 

Now consider a path that returns to its starting point, forming a closed, 
oriented curve C in spacetime. The Wilson loop is the trace of the Wilson 
line for this path, 

Wc = TrWc{x,x) . (82.11) 

Using eq. (82.9), we see that the Wilson loop is gauge invariant, 

Wc — > Wc ■ (82.12) 

Wl = W-c , 


Also, eq. (82.10) implies 


(82.13) 
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where — C denotes the curve C traversed in the opposite direction. 

To gain some intuition, we will calculate (0|Wc|0) for U(l) gauge theory, 
without charged fields. This is simply a free-held theory, and the calculation 
can be done exactly. 

In order to avoid dealing with ie issues, it is convenient to make a Wick 
rotation to euclidean spacetime (see section 29). The action is then 

5 = J d 4 x \F I1V F I1V , (82.14) 

where F^ = d^A u — d v A lt . The VEV of the Wilson loop is now given by 
the path integral 

(0|Wc|0> = J VAe i9 $c dx ^ e - s . (82.15) 


If we formally identify g § c rix' /t as a current (x ) , we can apply our results 
for the path integral from section 57. After including a factor of i from the 
Wick rotation, we get 


(0|lTc|0) = exp 


-\g 2 dx fj, j) dy v A ^(x-y) 


(82.16) 


where A ^ w (x—y) is the photon propagator in euclidean spacetime. In Feyn- 
man gauge, we have 


A fj, u (x-y) = 8, w J 


(2tt) 


= 5 , 


Ills 


47T 


d 4 k e ik < x -i /) 

P 

k 3 dk 
k 2 J 


d6 sm 2 0e ik \ x - y \ cose 


= < 5 , 


47t ^°° k 3 dk TrJi(k\x— y\) 

o 


(27 r) 4 J o k 2 k\x—y\ 


8 , 


[M/ 


47 t 2 (x — y ) 2 Jo 


du J\(u) 


J fll/ 


47 t 2 (x — y) 2 


(82.17) 


where J\ (u) is a Bessel function. Since A fi V {x—y) depends only on x—y, 
the double line integral in eq. (82.16) will yield a factor of the perimeter P 
of the curve C. There is also an ultraviolet divergence as x approaches y; 
we will cut this off at a length scale a. The result is then 


(0|1T C |0) = exp[ — (cy 2 / a)P ] , 


(82.18) 
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where c is a numerical constant that depends on the shape of C and the de- 
tails of the cutoff procedure. This behavior of the Wilson loop in euclidean 
spacetime — exponential decay with the length of the perimeter — is called 
the perimeter law. It is indicative of unconfined charges. 

We can gain more insight into the meaning of (0|Wc|0) by taking C 
to be a rectangle, with length T in the time direction and R in a space 
direction, where a <C R <C T. (Of course, in euclidean spacetime, the 
choice of the time direction is an arbitrary convention.) The reason for 
this particular shape is that the current g j> c dx ^ corresponds to a point 
charge moving along the curve C . When the particle is moving backwards 
in time, the associated minus sign is equivalent to a change in the sign of 
its charge. So when we compute (0|ITc|0), we are doing the path integral 
in the presence of a pair of point charges with opposite sign, separated by 
a distance R, that exists for a time T. On general principles (see section 
6), this path integral is proportional to exp(— 77 pa i r T), where impair is the 
ground-state energy of the quantum electrodynamic held in the presence of 
the charged particle pair. 

We now turn to the calculation. If both x and y are on the same side 
of the rectangle, we find 

[ L [ L dxdy = 2 L/a - 21n (L/a) + 0(1) , (82.19) 

Jo Jo ( X - yY 

where L is the length of the side (either R or T), and the 0(1) term is a 
numerical constant that depends on the details of the short-distance cutoff. 
If x and y are on perpendicular sides, the double line integral is zero, 
because then dx ■ dy = 0. If x is on one short side and y on the other, the 
integral evaluates to R 2 /T 2 , and this we can neglect. Finally, if x is on one 
long side and y is on the other, we have 

( x - yY + R 2 = 11 T/R ~ 2 ln(T/jR) ~ 2 + °( i?2 /^ 2 ) • (82.20) 

Adding up all these contributions, we find in the limit of large T that 

££ WW 2 = ( 4/ “ ” 2n/R ^ T + 0(1,1 T) ' (82 ' 21) 

Combining this with eqs. (82.16) and (82.17), and setting a = g 2 / 47r, we 
find 

<0|W C |0) =exp[-(— -^)t] . (82.22) 

\ira RJ 

Comparing this with the general expectation (0|ITc|0) oc exp(— .E pa i r T), 
we find a cutoff dependent contribution to impair that represents a divergent 
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2 



1 


Figure 82.1: The minimal Wilson loop on a hypercubic lattice goes around 
an elementary plaquette; this one lies in the 1-2 plane. 


self-energy for each point particle, plus the Coulomb potential energy for 
the pair, V (R) = —a/R. 

In the nonabelian case, where there are interactions among the gluons, 
we must expand everything in powers of g. Then we find 


<0|Wc|0> = Tr 


1 - \g 2 T a T a 


dx,, 


c 


c 


dyu (• x-y) + Q(g 4 ) 


. (82.23) 


Since T a T a equals the quadratic Casimir C(N) for the fundamental repre- 
sentation (times an identity matrix), we see that to leading order in g 2 we 
simply reproduce the results of the abelian case, but with g 2 — * g 2 C(N). 
We can also consider a Wilson loop in a different representation by setting 
A nix) = t4“(x)T“. Then, at leading order, we get a factor of C(R) instead 
of C(N). Perturbative corrections can be computed via standard Feynman 
diagrams with gluon lines that, in position space, have at least one end on 
the curve C. 

Next we turn to a strong coupling analysis. We begin by constructing 
a lattice action for nonabelian gauge theory. Consider a hypercubic lattice 
of points in four- dimensional euclidean spacetime, with a lattice spacing a 
between nearest-neighbor points. The smallest Wilson loop we can make 
on this lattice goes around an elementary square or plaquette, as shown in 
fig. (82.1). Let £\ and £2 be vectors of length a in the 1 and 2 directions, 
and let x be the point at the center of the plaquette. Using the center of 
each link as the argument of the gauge field, and using the lower-left corner 
as the starting point, we have (multiplying the Wilson links from right to 
left along the path) 


Rplaq = 


Tr e y i 9 a A2(x-£i/2) e -iga,Ai( x +E 2 / 2 ) e +iga,A 2 {x+Ei/ 2 ) e +igaAi(x-e 2 / 2 ) 

(82.24) 


If we now treat the gauge field as smooth and expand in a, we get 


Rplaq 


rjij. e -igaA2(x)+iga 2 diA2(x)/2+... e ~igaAi(x)-iga 2 c^Apa;) /2+... 


x e +igaA2{x)+iga 2 diA2(x)/2+... e +igaAi(x)-iga 2 d2Ai(x)/2+... 


(82.25) 
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Next we use e A e B = e A + B +[ A > B \/ 2 +— combine the two exponential factors 
on the first line of eq. (82.25), and also the two exponential factors on the 
second line. Then we use this formula once again to combine the two results. 
We get 

Wplaq = Tr e + i 9 a2 (9i A 2-d2 A i-ig[A 1 , A 2 ])+... ? (82.26) 

where all fields are evaluated at x. If we now take IFpiaq + IF_ p i aq and 
expand the exponentials, we find 


w^plaq + ^E-plaq — 2/V — g 2 a 4 TrF 2 2 + . . . , 


(82.27) 


where F\ 2 = d\A 2 —d 2 A\—ig[A\, A 2 ] is the Yang-Mills field strength. From 
eq. (82.27), we conclude that an appropriate action for Yang-Mills theory 
on a euclidean spacetime lattice is 

S = (82-28) 
plaq 

where the sum includes both orientations of all plaquettes. Each IF p i aq is 
expressed as the trace of the product of four special unitary NxN matrices, 
one for each oriented link in the plaquette. If U is the matrix associated 
with one orientation of a particular link, then U' is the matrix associated 
with the opposite orientation of that link. The path integral for this lattice 
gauge theory is 

Z = j VU e~ s , (82.29) 

where 

VU = dU link , (82.30) 

links 

and dU is the Haar measure for a special unitary matrix. The Haar measure 
is invariant under the transformation U — » VU, where V is a constant spe- 
cial unitary matrix, and is normalized via / dU = 1; this fixes it uniquely. 
For N > 3, it obeys 


dU Uij = 0 


dU UijUki — 0 , 
dU UijUki = jfSikSjt , 


(82.31) 

(82.32) 

(82.33) 


which is all we will need to know. 

Now consider a Wilson loop, expressed as the trace of the product of 
the C/’s associated with the oriented links that form a closed curve C. For 
simplicity, we take this curve to lie in a plane. We have 


(0|W C |0> = VUW c e 


-S 


(82.34) 
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We will evaluate eq. (82.34) in the strong coupling limit by expanding e~ s 
in powers of 1 / g 2 . At zeroth order, e~ s — > 1; then eq. (82.31) tells us that 
the integral over every link in C vanishes. To get a nonzero result, we need 
to have a corresponding U' from the expansion of e~ s . This can only come 
from a plaquette containing that link. But then the integral over the other 
links of this plaquette will vanish, unless there is a compensating U' for 
each of them. We conclude that a nonzero result for (0|Wc , |0) requires us 
to fill the interior of C with plaquettes from the expansion of e -5 . Since 
each plaquette is accompanied by a factor of 1/g 2 , we have 

<0|Wc|0) ~ (1 /g 2 ) A/a2 , (82.35) 

where A is the area of the surface bounded by C , and A/ a 2 is the number 
of plaquettes in this surface. Eq. (82.35) yields the area law for a Wilson 
loop, 

(0|W C |0) oc e~ rA , (82.36) 

where 

r = c(g)/a 2 (82.37) 

is the string tension. In the strong coupling limit, c(g) = ln(g 2 ) + 0(1). 

The area law for the Wilson loop implies confinement. To see why, 
let us again consider a rectangular loop with area A = RT. Comparing 
eq. (82.36) with the general expectation (0|H 7 c'|0) oc exp(— E pa i r T), we see 
that Ep a i r = V(R) = tR. This corresponds to a linear potential between 
nonabelian point charges in the fundamental representation. It takes an 
infinite amount of energy to separate these charges by an infinite distance; 
the charges are therefore confined. The coefficient r of R in V (R) is called 
the string tension because a linear potential is what we get from two points 
joined by a string with a fixed energy per unit length; the energy per unit 
length of a string is its tension. 

The string tension r is a physical quantity that should remain fixed 
as we remove the cutoff by lowering a. Thus lowering a requires us to 
lower g. The outstanding question is whether c(g) reaches zero at a finite, 
nonzero value of g. If so, at this point there is a phase transition to an 
unconfined phase with zero string tension. This has been proven to be the 
case for abelian gauge theory (which also exhibits an area law at strong 
coupling, by the identical argument). In nonabelian gauge theory, on the 
other hand, analytic and numerical evidence strongly suggests that c(g) 
remains nonzero for all nonzero values of g. 

At small a and small g, the behavior of g as a function of a is governed 
by the beta function, (3(g) = —adg/da. (The minus sign arises because the 
ultraviolet cutoff is a -1 .) Requiring r to be independent of a yields 

c'(g) = -\(3(g)c(g) . 


(82.38) 
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At small g, we have 

P{9) = -M 3 + 0(g 5 ) , (82.39) 

where b\ = 11A/487T 2 for SU(1V) gauge theory without quarks. Solving for 
c(g) yields 

c(g) = Cexp{l/big 2 ) , (82.40) 

where C is an integration constant, which is nonzero if there is no phase 
transition. In this case, at small g, the string tension has the form 

r = C exp(l/big 2 )/a 2 . (82.41) 

We then want to take the continuum limit of a — > 0 and g — > 0 with r held 
fixed. 

Note that eq. (82.41) shows that the string tension, at weak coupling, 
is not analytic in g , and so cannot be computed via the Taylor expansion 
in g that is provided by conventional weak-coupling perturbation theory. 
Instead, the path integrals of eqs. (82.29) and (82.34) can be performed on 
a finite-size lattice via numerical integration. The limiting factor in such a 
calculation is computer resources. 

Reference Notes 

An introduction to lattice theory is given in Smit. 

Problems 

82.1) Let C be a circle of radius R. Evaluate the constant c in eq. (82.18), 
where P = 2irR is the circumference of the circle. Replace l/(x— y) 2 
with zero when \x—y\ < a. Assume a <C R- 
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83 Chiral Symmetry Breaking 

Prerequisite: 76, 82 

In the previous section, we discussed confinement in Yang-Mills theory 
without quarks. In the real world, there are six different flavors of quark; 
see Table 1. Each flavor has a different mass, and is represented by a Dirac 
field in the fundamental or 3 representation of the color group SU(3). Such 
a Dirac field is equivalent to two left-handed Weyl fields, one in the funda- 
mental representation, and one in the antifundamental or 3 representation. 

The lightest quarks are the up and down quarks, with masses of a few 
MeV. These masses are small, in the following sense. The gauge coupling 
g of QCD becomes large at low energies. If we truncate the beta function 
after some number of terms (in practice, four or fewer), and integrate it, 
we find that g becomes infinite at some finite, nonzero value of the MS 
parameter /r; this value is called A QCD . Measurements of the strength of 
the gauge coupling at high energies imply A QCD ~ 0.2 GeV. The up and 
down quark masses are much less than A QCD . We can therefore begin with 
the approximation that the up and down quarks are massless. The mass of 
the strange quark is also somewhat less than A QCD . It is sometimes useful 
(though clearly less justified) to treat the strange quark as massless as well. 

If we are interested in hadron physics at energies below ~1 GeV, we can 
ignore the charm, bottom, and top quarks entirely; we will also ignore the 
strange quark for now. Let us, then, consider QCD with n F = 2 flavors 
of massless quarks. We then have left-handed Weyl fields Xai, where a = 
1, 2, 3 is a color index for the 3 representation, and i = 1, 2 is a flavor index, 
and left-handed Weyl fields £“*, where a = 1,2,3 is a color index for the 
3 representation, and i = 1,2 is a flavor index; we distinguish this flavor 
index from the one for the %’s by putting a bar over it, and we write it as 
a superscript for later notational convenience. We suppress the undotted 
spinor index carried by both x and £. The lagrangian is 

c = ix xpi + 4* ^(4)4 4* - i F a ^FZ u , (83.1) 

where D^ = d^- igT^A* and = d,, - igT?A“, with {T^) a fi = -(Tg)p a . 
In addition to the SU(3) color gauge symmetry, this lagrangian has a global 
U(2) x U(2) flavor symmetry: C is invariant under 

Xai ->• LflXaj , (83.2) 

, (83.3) 

where L and R* are independent 2x2 constant unitary matrices. (The 
complex conjugation of R is a notational convention that turns out to be 
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name 

symbol 

mass 

(GeV) 

Q 

up 

u 

0.0017 

+2/3 

down 

d 

0.0039 

-1/3 

strange 

s 

0.076 

-1/3 

charm 

c 

1.3 

+2/3 

bottom 

b 

4.3 

-1/3 

top 

t 

178 

+2/3 


Table 1: The six flavors of quark. Each flavor is represented by a Dirac 
field in the 3 representation of the color group SU(3). Q is the electric 
charge in units of the proton charge. Masses are approximate, and are MS 
parameters. For the it, d, and s quarks, the MS scale g is taken to be 2 GeV. 
For the c, b, and t quarks, g is taken to be equal to the corresponding mass; 
e.g., the bottom quark mass is 4.3 GeV when /i = 4.3 GeV. 

convenient.) In terms of the Dirac field 



eqs. (83.2) and (83.3) read 

P^ai -»• , (83.5) 

Pr** -»• RPP^aj , (83.6) 

where P L R = =F 7s)- Thus the global flavor symmetry is often called 
U(2 )l x U(2 )r. A symmetry that treats the left- and right-handed parts of 
a Dirac field differently is said to be chiral. 

However, there is an anomaly in the axial U(l) symmetry corresponding 
to L = R* = e ia I (which is equivalent to \k — ■> e _ * a7s T for the Dirac field) . 
Thus the nonanomalous global flavor symmetry is SU(2 )l x SU(2)rxU(1)v, 
where V stands for vector. The U(l)y transformation corresponds to 
L = R = e~ ia I, or equivalently 'k — > e _ * a T. The corresponding con- 
served charge is quark number , the number of quarks minus the number of 
antiquarks; this is one third of the baryon number, the number of baryons 
minus the number of antibaryons. ( Baryons are color-singlet bound states 
of three quarks; the proton and neuton are baryons. Mesons are color- 
singlet bound states of a quark and an antiquark; pions are mesons.) 

Thus, U(l)y results in classification of hadrons by their baryon num- 
ber. How is the SU(2 )l x SU(2)r symmetry realized in nature? The vector 
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subgroup SU(2)v, obtained by setting R = L in eq. (83.3), is known as iso- 
topic spin or isospin symmetry. Hadrons clearly come in representations 
of SU(2)y: the lightest spin-one-half hadrons (the proton, mass 0.938 GeV, 
and the neutron, mass 0.940 GeV) form a doublet or 2 representation, while 
the lightest spin-zero hadrons (the 7r°, mass 0.135 GeV, and the mass 
0.140 GeV) form a triplet or 3 representation. Isospin is not an exact sym- 
metry; it is violated by the small mass difference between the up and down 
quarks, and by electromagnetism. Thus we see small differences in the 
masses of the hadrons assigned to a particular isotopic multiplet. 

The role of the axial part of the SU(2 )l x SU(2)r symmetry, obtained 
by setting R = in eq. (83.3), is harder to identify. The hadrons do not 
appear to be classified into multiplets by a second SU(2) symmetry group. 
In particular, there is no evidence for a classification that distinguishes the 
left- and right-handed components of spin-one-half hadrons like the proton 
and neutron. 

Reconciliation of these observations with the SU (2 )l x SU (2)r symmetry 
of the underlying lagrangian is only possible if the axial generators are 
spontaneously broken. The three pions (which have spin zero, odd parity, 
and are by far the lightest hadrons) are then identified as the corresponding 
Goldstone bosons. They are not exactly massless (and hence are sometimes 
called pseudogoldstone bosons) because the SU(2 )l x SU(2)r symmetry is, 
as we just discussed, not exact. 

To spontaneously break the axial part of the SU(2 )l x SU(2)r, some op- 
erator that transforms nontrivially under it must acquire a nonzero vacuum 
expectation value, or VEV for short. To avoid spontaneous breakdown of 
Lorentz invariance, this operator must be a Lorentz scalar, and to avoid 
spontaneous breakdown of the SU(3) gauge symmetry, it must be a color 
singlet. Since we have no fundamental scalar fields that could acquire a 
nonzero VEV, we must turn to composite fields instead. The simplest can- 
didate is x a aiiV = ^ aj ai, where a is an undotted spinor index. (The 
product of two fields is generically singular, and a renormalization scheme 
must be specified to define it.) We assume that 

<0|x£iCT> = -v 3 6S , (83.7) 

where v is a parameter with dimensions of mass. Its numerical value 
depends on the renormalization scheme; for MS with /j, = 2 GeV, v — 
0.23 GeV. 

To see that this fermion condensate does the job of breaking the axial 
generators of SU(2 )l x SU(2)r while preserving the vector generators, we 
note that, under the transformation of eqs. (83.2) and (83.3), 

<oixs ( e J io) - ii‘(R*) J «(oix^?rio) 
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-»• -v 3 (LR f )/ , (83.8) 

where we used eq. (83.7) to get the second line. If we take R = L, cor- 
responding to an SU(2)y transformation, the right-hand side of eq. (83.8) 
is unchanged from its value in eq. (83.7). This signifies that SU(2)y [and 
also U(l)v] is unbroken. However, for a more general transformation with 
R^L, the right-hand side of eq. (83.8) does not match that of eq. (83.7), 
signifying the spontaneous breakdown of the axial generators. 

Eq. (83.7) is nonperturbative : (0|Xaj£a^|0) vanishes at tree level. Pertur- 
bative corrections then also vanish, because of the chiral flavor symmetry 
of the lagrangian. Thus the value of v is not accessible in perturbation the- 
ory. On general grounds, we expect v ~ A QCD , since A QCD is the only mass 
scale in the theory when the quarks are massless. Similarly, A QCD sets the 
scale for the masses of all the hadrons that are not pseudogoldstone bosons, 
including the proton and neutron. 

We can construct a low-energy effective lagrangian for the three pseu- 
dogoldstone bosons (to be identified as the pions) in the following way. We 
allow the orientation in flavor space of the VEV of Xai^a^ to vary slowly as 
a function of spacetime. That is, in place of eq. (83.7), we write 

( 0 lxSi(*)Co J (*)l 0 ) = - v 3 Ui>{x ) , (83.9) 

where U(x) is a spacetime dependent unitary matrix. We can write it as 

U(x) = exp[2i7r a (x)T“// 7r ] , (83.10) 

where T a = ^a a with a = 1,2,3 are the generator matrices of SU(2), 
7 r a (x) are three real scalar fields to be identified with the pions, and f n is 
a parameter with dimensions of mass, the pion decay constant. We do not 
include a fourth generator matrix proportional to the identity, since the 
corresponding held would be the Goldstone boson for the U(1)a symmetry 
that is eliminated by the anomaly. Equivalently, we require det U (x) = 1 . 

We will think of U(x) as an effective, low energy held. Its lagrangian 
should be the most general one that is consistent with the underlying 
SU(2 )l x SU(2 )r symmetry. 1 Under a general SU(2 )l x SU(2 )r trans- 
formation, we have 

U(x) -»• LU{x)R ] , (83.11) 

where L and R are independent special unitary matrices. We can organize 
the terms in the effective lagrangian for U(x) (also known as the chiral 
lagrangian ) by the number of derivatives they contain. Because U'U = 1, 


1 U(l)v acts trivially on U(x), and so we need not be concerned with it. 
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there are no terms with no derivatives. There is one term with two (all 
others being equivalent after integrations by parts), 

jC = -^Trd^U%U. (83.12) 

If we substitute in eq. (83.10) for U, and expand in inverse powers of f n , 
we find 

C = -i^7r°^7r° + lf- 2 (ir a ir a d^TT b d^TT b - ttVoV^tt 0 ) + . . . . (83.13) 

Thus the pion fields are conventionally normalized, and they have interac- 
tions that are dictated by the general form of eq. (83.12). These interactions 
lead to Feynman vertices that contain factors of momenta p divided by f n . 
Therefore, we can think of p/ f n as an expansion parameter. Of course, 
we should also add to C all possible inequivalent terms with four or more 
derivatives, with coefficients that include inverse powers of f n . These will 
lead to more vertices, but their effects will be suppressed by additional 
powers of p/ f n . Comparison with experiment then yields f n = 92.4 MeV. 
(In practice, the value of f n is more readily determined from the decay rate 
of the pion via the weak interaction; see section 90 and problem 48.5.) 

This value for f n may seem low; it is, for example, less than the mass 
of the “almost masless” pions. However, it turns out that tree and loop 
diagrams contribute roughly equally to any particular process if each extra 
derivative in C is accompanied by a factor of (Air/jr )" 1 rather that f~ x , and 
each loop momentum is cut off at 4 irf n . Thus it is 4 irf n ~ 1 GeV that sets 
the scale of the interactions, rather than ~ 100 MeV. 

Now let us consider the effect of including the small masses for the up 
and down quarks. The most general mass term we can add to the lagrangian 
is 


Anass = -C J MjXai + h.C. 


= + c. 


= -Tr M X aC + h.c. , (83.14) 


where M is a complex 2x2 matrix. By making an SU(2 )l x SU(2)r 
transformation, we can bring M to the form 


M = 




(83.15) 


where m u and m c j are real and positive. We cannot remove the overall 
phase 6, however, without making a forbidden U(1)a transformation. A 
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nonzero value of 6 has physical consequences, as we will discuss in section 
94. For now, we note that experimental observations fix \9\ < 10~ 9 , and so 
we will set 0 = 0 on this phenomenological basis. 

Next, we replace Xa£ a i n eq. (83.14) with its spacetime dependent YEV, 
eq. (83.7). The result is a term in the chiral lagrangian that incorporates 
the leading effect of the quark masses, 

-Cmass = v 3 Tr (MU + M* C/t) . (83.16) 

Here we continue to distinguish M and M', even though, with 0 = 0, they 
are the same matrix. If we think of M as transforming as M — > RML\ 
while U transforms as U — ► LUR', then Tr MU is formally invariant. We 
then require all terms in the chiral lagrangian to exhibit this formal invari- 
ance. 

If we expand £ mass in inverse powers of f 7 r , and use = M, we find 

Anass = -4 (u 3 //2) TV + . . . 

= -2{v 3 /fl) Tr (M{T a , T b })Tr a ir b + ... 

= -(v 3 / fl){Tr M)n a n a + . . . . (83.17) 

We used the SU(2) relation {T a ,T b } = ^5 ab to get the last line. From 
eq. (83.17), we see that all three pions have the same mass, given by the 
G ell- Mann- Oakes-Renner relation , 


ml = 2 (m u + in d )v 3 /f^ . (83.18) 


On the right-hand side, the quark masses and v 3 depend on the renormaliza- 
tion scheme, but their product does not. In the real world, electromagnetic 
interactions raise the mass of the slightly above that of the 7r°. 

This framework is easily expanded to include the strange quark. The 
three pions (7r + , n~ , mass 0.140 GeV; 7r°, mass 0.135 GeV), the four kaons 
(. K + , K~ , mass 0.494 GeV; K°, K°, mass 0.498 GeV), and the eta (?/, mass 
0.548 GeV) are identified as the eight expected Goldstone bosons. We can 
assemble them into the hermitian matrix 


n = rr a T a /U 


1 

2A 


/ 7T° + -4^7/ \/2i r H 

\/27r _ — 7T° + 

V V2K- 


V2 K° 


V2K+ \ 
V2. K° 


■fa/ 


(83.19) 


The second line of eq. (83.17) still applies, but now the T a 's are the genera- 
tors of SU(3), and M includes a third diagonal entry for the strange quark 
mass. We leave the details to the problems. 
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Next we turn to the coupling of the pions to the nucleons (the proton 
and neutron). We define a Dirac field JVj, where N\ = p (the proton) 
and N 2 = n (the neutron). We assume that, under an SU(2 )l x SU(2)r 
transformation, 

PrNi -» LPp h Nj , (83.20) 

PrN, -> R,Pp it Nj . (83.21) 

The standard Dirac kinetic term iNlpN is then SU(2 )l x SU(2)r invariant, 

but the standard mass term rn^NN is not. (Here mjv is the value of the 
nucleon mass in the limit of zero up and down quark masses.) However, 
we can construct an invariant mass term by including appropriate factors 
of U and U\ 

-Cmass = -m N N(U^P R + UP R )N . (83.22) 

There is one other parity, time-reversal, and SU(2 )l x SU(2)r invariant 
term with one derivative. Including this term, we have 

C = iN<j)N - m N N(U ] P R + UP r )N 

- \{g A -l)iN^{Ud^P R + U^d^UP R )N , (83.23) 

where g A = 1.27 is the axial vector coupling. Its value is determined from 
the decay rate of the neutron via the weak interaction; see section 90. 

The form of the lagrangian in eq. (83.23) is somewhat awkward. It can 


be simplified by first defining 

u(x) = ex.p[iir a (x)T a / , (83.24) 

so that U ( x ) = u 2 (x) . Then we define a new nucleon field 

J\f = (u^ P L + uP r )N . (83.25) 

(This is a field, redefinition in the sense of problem 11.5.) Equivalently, 
using the unitarity of u, we have 

N = ( uP L + u^P n )fif . (83.26) 

Using eq. (83.26) in eq. (83.23), along with the identities d^U = (d^u)u + 
u(d^u), (djpfiju = —v)(d^u), etc., we ultimately find 

C = iJ\f<j)Af — ijinMN + — g A N/iyz,N , (83.27) 

where we have defined the hermitian vector fields 

+ u(d^)\ , (83.28) 

= \i{u\d,j,u) - ufid^)] . (83.29) 
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If we now expand u and u' in inverse powers of f w , we get 

C = iJf^M - m N m + (WA)^7T‘WTV75Af + . . . . (83.30) 

We can integrate by parts in the interaction term to put the derivative on 
the U and M fields. Then, if we consider a process where an off-shell pion 
is scattered by an on-shell nucleon, we can use the Dirac equation to replace 
the derivatives of l\f and M with factors of m^- We then find a coupling 
of the pion to an on-shell nucleon of the form 

^ttNN = ~^9irNN^ A^< 75 A? , (83.31) 

where we have set T a = \cr a , and identified the pion-nucleon coupling 
constant, 

9-kNN = 9A.m N /U • (83.32) 

The value of g_ can be determined from measurements of the neutron- 
proton scattering cross section, assuming that it is dominated by pion ex- 
change; the result is g n j^ N = 13.5. Eq. (83.32), known as the Goldberger - 
Treiman relation, is then satisfied to within about 5%. 

Reference Notes 

The chiral lagrangian is treated in Georgi, Ramond II, and Weinberg II. 
Light quark masses are taken from MILC. 

Problems 

83.1) Suppose that the color group is SO(3) rather than SU(3), and that 
each quark flavor is represented by a Dirac field in the 3 representation 
of SO(3). 

a) With n F flavors of massless quarks, what is the nonanomalous 
flavor symmetry group? 

b) Assume the formation of a color-singlet, Lorentz scalar, fermion 
condensate. Assume that it preserves the largest possible unbroken 
subgroup of the flavor symmetry. What is this unbroken subgroup? 

c) For the case n F = 2, how many massless Goldstone bosons are 
there? 

d) Now suppose that the color group is SU(2) rather than SU(3), 
and that each quark flavor is represented by a Dirac field in the 2 
representation of SU(2). Repeat parts (a), (b), and (c) for this case. 
Hint: at least one of the answers is different! 
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83.2) Why is there a minus sign on the right-hand side of eq. (83.7)? 

83.3) Verify that eq. (83.13) follows from eq. (83.12). 

83.4) Use eqs. (83.12) and (83.16) to compute the tree-level contribution to 
the scattering amplitude for TT a ir b — ► 7r c ir d . Work in the isospin limit, 
m u = md = m. Express your answer in terms of the Mandelstam 
variables and the pion mass m v . 

83.5) Verify that eq. (83.27) follows from eqs. (83.26) and eq. (83.23). 

83.6) Consider the case of three light quark flavors, with masses m u , md , 
and m s . 

a) Find the masses-squared of the eight pseudogoldstone bosons. Take 
the limit m U} d m s , and drop terms that are of order d / m s- 

b) Assume that m^± and m 2 K ± each receive an electromagnetic con- 
tribution; to zeroth order in the quark masses, this contribution is the 
same for both, but the comparatively large strange quark mass results 
in an electromagnetic contribution to m 2 K ± that is roughly twice as 
large as the electromagnetic contribution A m| M to m 2 ±. Use the 
observed masses of the 7r ± , 7 r°, K±, and K° to compute m u v 3 / f 2 , 
m d v 3 /f 2 , m s v 3 /f 2 , and A m| M . 

c) Compute the quark mass ratios m u /md and m s /md- 

d) Use your results from part (b) to predict the rj mass. How good is 
your prediction? 

83.7) Suppose that the U(1 )a symmetry is not anomalous, so that we must 
include a ninth Goldstone boson. We can write 

U(x) = exp[2i7r a (x)T a // vr + ?'7 t 9 (x)//9] . (83.33) 

The ninth Goldstone boson is given its own decay constant fg , since 
there is no symmetry that forces it to be equal to f n . We write the 
two-derivative terms in the lagrangian as 

C = -\fl TV - \F 2 d ^{ det t/ t )9 /i ( det U) . (83.34) 

a) By requiring all nine Goldstone fields to have canonical kinetic 
terms, determine F in terms of f n and fg. 

b) To simplify the analysis, let m u = md = m <C m s . Find the masses 

of the nine pseudogoldstone bosons. Identify the three lightest as the 
pions, and call their mass m,. Show that another one of the nine 
has a mass less than or equal to (The nonexistence of such a 
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particle in nature is the U(l) problem ; the axial anomaly solves this 
problem.) 

83.8) a) Write down all possible parity and time-reversal invariant terms 
with no derviatives that are bilinear in the nucleon field N and that 
have one factor of the quark mass matrix M. 

b) Reexpress your result in terms of the nucleon field M . 

c) Use the observed neutron-proton mass difference, m n — m p = 
1.293 MeV, and the m u /m f j ratio you found in problem 83.6, to de- 
temine as much as you can about the coefficients of the terms wrote 
down. (Ignore the mass difference due to electromagnetism.) 
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84 Spontaneous Breaking of Gauge 

Symmetries 

Prerequisite: 32, 70 

Consider scalar electrodynamics, specified by the lagrangian 

C = -(D^D^ - V(<p) - \F^F, U , (84.1) 

where <p is a complex scalar field, D ^ = 8^ — igA and 

U(<£>) = m 2 <^V + i^(^V) 2 • (84.2) 

(We call the gauge coupling constant g rather than e because we are using 
this theory as a formal example rather than a physical model.) So far we 
have always taken m 2 > 0, but now let us consider m 2 < 0. We analyzed 
this model in the absence of the gauge field in section 32. Classically, the 
field has a nonzero vacuum expectation value (VEV for short), given by 

<0|^(*)|0> = , (84.3) 

where we have made a global U(l) transformation to set the phase of the 
VEV to zero, and 

v = (4|m 2 |/A) 1,/2 . (84.4) 

We therefore write 

<P( X ) = + p{x))e~ lx{x),v , (84.5) 

where p(x) and x( x ) are rea l scalar fields. The scalar potential depends 
only on p, and is given by 

V(<p) = \\v 2 p 2 + | A vp 3 + jq\ p A ■ (84.6) 

Since x does not appear in the potential, it is massless; it is the Goldstone 
boson for the spontaneously broken U(l) symmetry. 

The big difference in the gauge theory is that we can make a gauge 
transformation that shifts the phase of ip(x) by an arbitrary spacetime 
function. We can use this gauge freedom to set x( x ) = 0; this choice is 
called unitary gauge. Using eq. (84.5) with x{ x ) = 0 in eq. (84.1), we have 

= -i(<9 M p + ig{y + p)A^ l )(d ll p - ig(v + p)A fl ) 

= -Xd^pd^p - \g 2 {y + p) 2 A ^ . (84.7) 

Expanding out the last term, we see that the gauge field now has a mass 

M = gv . (84.8) 
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This is the Higgs mechanism : the Goldstone boson disappears, and the 
gauge field acquires a mass. Note that this leaves the counting of particle 
spin states unchanged: a massless spin-one particle has two spin states, 
but a massive one has three. The Goldstone boson has become the third 
or longitudinal state of the now-massive gauge field. A scalar field whose 
VEV breaks a gauge symmetry is generically called a Higgs field. 

This generalizes in a straightforward way to a nonabelian gauge theory. 
Consider a complex scalar field p in a representation R of the gauge group. 
The kinetic term for p is —(D^pyD^p, where the covariant derivative is 
{D^ip)i = d tl p % — ig a A“ ( ) fi pj , and the indices i and j run from 1 to d( R). 
We assume that p acquires a VEV 

(0|<Pi(aO|0> = -j^Vi , (84.9) 

where the value of Vj is determined (up to a global gauge transformation) 
by minimizing the potential. If we replace p by its VEV in — (D^p^D^p, 
we find a mass term for the gauge fields, 

£mass = 4(M 2 )“ 6 A“^ , (84.10) 

where the mass-squared matrix is 

(M 2 ) ab = \g 2 v*{T^Ti} ijVj . (84.11) 

The anticommutator appears because A afi A b L is symmetric on a <-* b, and 
so we replaced T“T* with ^{T“, T b }. 

If the field p is real rather than complex (which is possible only if R 
is a real representation), then we remove the factor of root-two from the 
right-hand side of eq. (84.9), but this is compensated by an extra factor of 
one-half from the kinetic term for a real scalar field; thus eq. (84.11) holds 
as written. If there is more than one gauge group, then the g 2 in eq. (84.11) 
is replaced by g a 9bi where g a is the coupling constant that goes along with 
the generator T a , and all generators of all gauge groups are included in the 
mass-squared matrix. 

Recall from section 32 that a generator T a is spontaneously broken if 
(Tfi)ij v ] 0. From eq. (84.11), we see that gauge fields corresponding 
to broken generators get a mass, while those corresponding to unbroken 
generators do not. The unbroken generators (if any) form a gauge group 
with massless gauge fields. The massive gauge fields (and all other fields) 
form representations of this unbroken group. 

Let us work out some simple examples. 

Consider the gauge group SU(IV), with a complex scalar field p in the 
fundamental representation. We can make a global SU(1V) transforma- 
tion to bring the VEV entirely into the last component, and furthermore 
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make it real. Any generator ( T a )i J that does not have a nonzero entry in 
the last column will remain unbroken. These generators form an unbro- 
ken SU(1V — 1) gauge group. There are three classes of broken generators: 
those with ( T a )i N = \ for i ^ N (there are N — 1 of these); those with 
( T a )i N = —\i for i ^ N (there are also N — 1 of these), and finally the sin- 
gle generator T jV “ _1 = [2N(N— l)]^ 1 / 2 diag(l, . . . , 1, — (N— 1)). The gauge 
fields corresponding to the generators in the first two classes get a mass 
M = \gv, we can group them into a complex vector field that transforms in 
the fundamental representation of the unbroken SU(1V — 1) subgroup. The 
gauge field corresponding to T N ~ 4 gets a mass M = [(N— 1)/2N] 1,/2 gv; it 
is a singlet of SU(AT— 1). 

Consider the gauge group SO(IV), with a real scalar field in the fun- 
damental representation. We can make a global SO(N ) transformation to 
bring the VEV entirely into the last component. Any generator ( T a )i 3 that 
does not have a nonzero entry in the last column will remain unbroken. 
These generators form an unbroken SO(V— 1) subgroup. There are N — 1 
broken generators, those with (T a )j A = —i for i ^ N . The corresponding 
gauge fields get a mass M = gv, they form a fundamental representation 
of the unbroken SO(N-l) subgroup. In the case N = 3, this subgroup is 
SO(2), which is equivalent to U(l). 

Consider the gauge group SU(IV), with a real scalar field 4> a in the 
adjoint representation. It will prove more convenient to work with the 
matrix- valued field = ip a T a ; the covariant derivative of 4> is = 

— igA^[T a , 4>], and the VEV of ip is a traceless hermitian N x N 
matrix V. Thus the mass-squared matrix for the gauge fields is ( M 2 ) ab = 
— \g 2 Tr{[T a , V], [T b , V]}. We can make a global SU(IV) transformation to 
bring V into diagonal form. Suppose the diagonal entries consist of N\ rq’s, 
followed by N 2 V 2 , s, etc., where v\ < V 2 < ■ ■ ■ , and Yli AjU* = 0. Then all 
generators whose nonzero entries lie entirely within the i th block commute 
with V, and hence form an unbroken SU(IVj) subgroup. Furthermore, the 
linear combination of diagonal generators that is proportional to V also 
commutes with V , and forms a U(l) subgroup. Thus the unbroken gauge 
group is SU(Vi) x SU^A^) x . . . x U(l). The gauge coupling constants for 
the different groups are all the same, and equal to the original SU(IV) gauge 
coupling constant. 

As a specific example, consider the case of SU(5), which has 24 gener- 
ators. Let the diagonal entries of V be given by (— — |, — +|, +\)v. 

The unbroken subgroup is then SU(3) x SU(2) x U(l). The number of 
broken generators is 24 — 8 — 3 — 1 = 12. The generator of the U(l) sub- 
group is T 24 = diag(— |c, — ^c, — ^c, +|c, +^c), where c 2 = 3/5. Under the 
unbroken SU(3) x SU(2) x U(l) subgroup, the 5 representation of SU(5) 
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transforms as 

5-(3,l,-±)©(l,2,+±). (84.12) 

Here the last entry is the value of T 24 /c. The 5 of SU(5) then transforms 

as 

5-(3,l,+±)©(l,2,-±). (84.13) 

To find out how the adjoint or 24 representation of SU(5) transforms under 
the SU(3) x SU(2) x U(l) subgroup, we use the SU(5) relation 

5(g)5 = 24© 1. (84.14) 

From eqs. (84.12) and (84.13), we have 

5®5^ [(3,1,— i)©(l,2,+i)]® [(3,l,+i)©(l,2,-i)] . (84.15) 

If we expand this out, and compare with eq. (84.14), we see that 

24 — > (8, 1,0) ©(1,3, 0)0 (1,1,0) 

© (3, 2, — |) © (3, 2, +|) . (84.16) 

The first line on the right-hand side of eq. (84.16) is the adjoint repre- 
sentation of SU(3) x SU(2) x U(l); the corresponding gauge fields remain 
massless. The second line shows us that the gauge fields corresponding to 
the twelve broken generators can be grouped into a complex vector field 
in the representation (3, 2, — |). Since it is an irreducible representation of 
the unbroken subgroup, all twelve vectors fields must have the same mass. 
This mass is most easily computed from (Af 2 ) 44 = — g 2 Tr([T 4 , V] [T 4 , P]), 
where we have defined (T 4 )^ = 4 + 5 t 4 S : ' the result is M = -yj^gv. 

Problems 


84.1) Conside a theory with gauge group SU(iV), with a real scalar field <J> 
in the adjoint representation, and potential 

P(4>) = im 2 Tr <J> 2 + |Ai Tr $ 4 + ±A 2 (Tr $ 2 ) 2 . (84.17) 


This is the most general potential consistent with SU(iV) symmetry 
and a Z 2 symmetry 4> — 4>, which we impose to keep things sim- 

ple. We assume to 2 < 0. We can work in a basis in which $ = 
v diag(«i, , ajv), with the constraints J2i a i = 0 and Yli a i = 1- 
a) Extremize V (4*) with respect to v. Solve for v, and plug your result 
back into V^T). You should find 


P(4>) 


4(m 2 ) 2 

X\A(a) + A 2 -S(oi) 


(84.18) 
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where A(a) and B(a) are functions of on. 

b) Show that AiH(a) + \ 2 B(a) must be everywhere positive in order 
for the potential to be bounded below. 

c) Show that the absolute mimimum of the potential (assuming that 
it is bounded below) occurs at the absolute minimum of AiH(a) + 
A 2 B(a). 

d) Show that, at any extremum of the potential, the on take on at 
most three different values, and that these three values sum to zero. 
Hint: impose the constraints with Lagrange multipliers. 

e) Show that, for Ai > 0 and A 2 > 0, at the absolute minimum of 
H(<L) the unbroken symmetry group is SU(-/V+) x SU(1V_) x U(l), 
where N + = 1V_ = | N if N is even, and N± = |(A r ±l) if N is odd. 
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85 Spontaneously Broken Abelian Gauge 

Theory 

Prerequisite: 61, 84 

Consider scalar electrodynamics, specified by the lagrangian 


C = - V{<p) - \F^F, lv , (85.1) 

where p is a complex scalar field and D M = d lt — igA M . We choose 

V(<p) = iM^V- h y2 ) 2 , (85.2) 

which yields a nonzero VEV for ip. We therefore write 

p{ x ) = ^{v + p{ x ))e- i ^/\ (85.3) 

where p(x) and x( x ) are real scalar fields. The scalar potential depends 
only on p, and is given by 

V(tp) = 3 A v 2 p 2 + A A vp i + ^Ap 4 . (85.4) 

We can now make a gauge transformation to set 

X (x) = 0 . (85.5) 


This is unitary gauge. The kinetic term for <p becomes 

-(D^ip^D^ip = - \d^ l pd^p - \g 2 {y + p) 2 AG4 M . (85.6) 

We see that the gauge field has acquired a mass 

M = gv . (85.7) 

The terms in C that are quadratic in A ^ are 

£ 0 = -i - \M 2 A^ . (85.8) 

The equation of motion that follows from eq. (85.8) is 

[(- d 2 + M 2 )g ^ + 8^d u ]A u = 0 . (85.9) 

If we act with d on this equation, we get 

M 2 d u A u = 0 . (85.10) 
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If we now use eq. (85.10) in eq. (85.9), we find that each component of A u 
obeys the Klein-Gordon equation, 

{-d 2 + M 2 )A u = 0 . (85.11) 


The general solution of eqs. (85.9) and (85.10) is 


A^(x) 


E 

A=— ,0,+ 


dk 




(k)a\{k)e 


• kl +4(k)a\tk)e 


—ikx 


(85.12) 


where the polarization vectors must satisfy k^e^k) = 0. In the rest frame, 
where k = (Af, 0,0,0), we choose the polarization vectors to correspond to 
definite spin along the z axis, 

M°) = 75(0,1, -i,o) , 

e_(0) = ^(0,1, -H,0) , 

eo(0) = (0,0, 0,1) . (85.13) 


More generally, the three polarization vectors along with the timelike unit 
vector kJ 1 /M form an orthonormal and complete set, 

k-e^(k) = 0 , (85.14) 

£ \'( k )- £ \(k) = 5y\ , (85.15) 

£ (85.16) 

A=— ,0,+ 

Since the lagrangian of eq. (85.8) has no manifest gauge invariance, 
quantization is straightforward. The coefficients a\(k) and a\(k) become 
particle creation and annihilation operators in the usual way, and the prop- 
agator of the A n field is given by 


i(0\TA^x)A"(y)\0) 


d 4 k e ik ( x ~y) 
(27r) 4 k 2 + M 2 — ie 


e?{k)eW) 


d 4 k e iH x ~y) / w k»k u \ 
(27 r) 4 k 2 + Af 2 - ie \ 9 + M 2 ) ' 


(85.17) 


The interactions of the massive vector field A /t with the real scalar field p 
can be read off of eq. (85.6). The self-interactions of the p field can be read 
off of eq. (85.4). The resulting Feynman rules can be used for tree-level 
calculations. 

Loop calculations are more subtle. We have imposed the gauge condi- 
tion x( x ) = 0, which corresponds to inserting a functional delta function 
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IMM*)) into the path integral. In order to integrate over we must 
make a change of integration variables from Re p and Im 99 to p and x\ this 
is simply a transformation from cartesian to polar coordinates, analogous 
to dx dy = r dr d(f>- So we must include a factor analogous to r in the 
functional measure; this factor is 

H(u + p(x)) = det(u + p) 

X 

oc det(l + v~ 1 p) 

oc J VcVc e ~ im ^f d4x 5(1+w ' lp)c . (85.18) 

In the last line, we have written the functional determinant as an integral 
over ghost fields. We see that they have no kinetic term, and we have 
chosen the overall nomalization of their action so that their mass is m g h, 
where m g h is an arbitrary mass parameter. Thus the momentum-space 
propagator for the ghosts is simply A (k 2 ) = 1 /rri 2 h . We also see that there 
is a ghost-ghost-scalar vertex, with vertex factor but there is no 

interaction between the ghosts and the vector field. 

This seems like a fairly convenient gauge for loop calculations, but there 
is a complication. The fact that the ghost propagator is independent of the 
momentum means that additional internal ghost propagators do not help 
the convergence of loop-momentum integrals. The same is true of vector- 
field propagators; from eq. (85.17) we see that, in momentum space, the 
propagator scales like 1 /AT 2 in the limit that all components of k become 
large. Thus, in unitary gauge, loop diagrams with arbitrarily many external 
lines diverge. This makes it difficult to establish renormalizability. 

A gauge that does not suffer from this problem is a generalization of 
R £ gauge (and in fact this name has traditionally been applied only to this 
generalization). We begin by using a cartesian basis for 99, 

99 = -^(v + h + ib) , (85.19) 

where h and b are real scalar fields. In terms of h and 6, the potential is 
V{p) = \\v 2 h 2 + | A vh{h 2 + b 2 ) + ±\(h 2 + b 2 f , (85.20) 

and the covariant derivative of p is 

Dfjip = -j= (dfjh + gbAfj) + *(£^6 - g(v+h)A,j,) . (85.21) 

Thus the kinetic term for <p becomes 

= -\{d + gbA M ) 2 - ±(d M & - g(v+h)A M ) 2 . (85.22) 
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Expanding this out, and rearranging, we get 

= -^dfjid^h - \d^bd^b - \g 2 v 2 A^A^ + gvA^d^b 

+ g A^ (hd^b - bd^h) 

- gvhA^A^ - \g 2 {h 2 + b 2 )A^A^ . (85.23) 

The first line on the right-hand side of eq. (85.23) contains all the terms 
that are quadratic in the fields. The first two are the kinetic terms for the 
h and b fields. The third is the mass term for the vector field. The fourth 
is an annoying cross term between the vector field and the derivative of b. 

In abelian gauge theory, in the absence of spontaneous symmetry break- 
ing, we fix gauge by adding to C the gauge-fixing and ghost terms 

£gf + £gh = 4r lG<2 - c ^ c , (85.24) 

where G = d IJ An , and 0(x) parameterizes an infinitesimal gauge transfor- 
mation, 

A^ Ap- d„0 , (85.25) 

ip — > (p — igOip . (85.26) 

With G = we have 5G/59 = — d 2 . Thus the ghost fields have no 

interactions, and can be ignored. 

In the presence of spontaneous symmetry breaking, we choose instead 

G = df'A,, - igvb , (85.27) 

which reduces to d^A^ when v = 0. Multiplying out G 2 , we have 

£gf = + gvb& J ’A ll - \ig 2 v 2 b 2 

= - gvA^b - \tg 2 v 2 b 2 , (85.28) 

where we integrated by parts in the first two terms to get the second line. 
Note that the second term on the second line of eq. (85.28) cancels the 
annoying last term on the first line of eq. (85.23). Also, the last term on 
the second line of eq. (85.28) gives a mass £}/ 2 M to the b field. 

We must still evaluate £ g h- To do so, we first translate eq. (85.26) into 

h — > h + g6b , (85.29) 

b — ■> b — g6{v + h) . (85.30) 


Then we have 


5G 

Je 


d 2 + ig 2 v{y + h) . 


(85.31) 
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From eq. (85.24) we see that the ghost lagrangian is 

£ g h = — c -d 2 + £g 2 v(v + h) c 

= —d^cd^c — 8,g 2 v 2 cc — t,g 2 vhcc . (85.32) 

We see from the second term that the ghost has acquired the same mass 
as the b field, £}/ 2 M. 

Now let us examine the vector field. Including £ g f, the terms in C that 
are quadratic in the vector field can be written as 

Co = [g^i-d 2 + M 2 ) + (1-r 1 )^] A v . (85.33) 

In momentum space, this reads 

Co = [(fc 2 + M 2 )g^ + (1-r 1 )^] A v (k) . (85.34) 

The kinematic matrix is 

. . . ] ={k 2 + M 2 )g ^ + (1-r 1 )^ 

= (k 2 + M 2 )(p^(k) + k^k u /k 2 ^ + 

= (k 2 + M 2 )P^ u {k) + C\k 2 + ZM 2 )Wk u /k 2 , (85.35) 

where P fiu (k) = g ^ — k^kC/k 2 projects onto the transverse subspace; 
P fiu (k) and k^k u /k 2 are orthogonal projection matrices. Using this fact, 
it is easy to invert eq. (85.35) to get the propagator for the massive vector 
field in R g gauge, 


A^(k) 


P^(k) 
k 2 + M 2 — ie 


£ k^k u /k 2 
k 2 + £M 2 — ie 


(85.36) 


We see that the transverse components of the vector field propagate with 
mass M, while the longitudinal component propagates with the same mass 
as the b and ghost fields, £ 1//2 M. 

Eq. (85.36) simplifies greatly if we choose £ = 1; then we have 


= (f = 11 - (85 ' 37) 


On the other hand, leaving £ as a free parameter allows us to check that all 
£ dependence cancels out of any physical scattering amplitude. Since their 
masses depend on £, the ghosts, the b field, and the longitudinal component 
of the vector field must all represent unphysical particles that do not appear 
in incoming or outgoing states. 
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To summarize, in R g gauge we have the physical h field with mass- 
squared = \\v 2 and propagator 1/(A; 2 + m|), the unphysical b field 
with propagator 1 /{k 2 + £M 2 ), the ghost fields c and c with propagator 
1 f(k 2 + £M 2 ), and the vector field with the propagator of eq. (85.36). For 
external vectors, the polarizations are still given by eq. (85.13), and obey 
the sum rules of eq. (85.16). The mass parameter M is given by M = gv. 
The interactions of these fields are governed by 

A = -\X vh{h 2 + b 2 ) - ±\{h 2 + b 2 ) 2 

+ gA^(hd^b - bd^h) - gvhA^A^ - \g 2 {h 2 + b 2 )A^A^ 

— 8 > g 2 vhcc . (85.38) 

It is interesting to consider the limit £ — > oo. In this limit, the vector 
propagator in R ^ gauge, eq. (85.36), turns into the massive vector propa- 
gator of eq. (85.17), 


K^{k) 


g nv + k n k ^/ M 2 

k 2 + M 2 — ie 


(? = oo) . 


(85.39) 


The b field becomes infinitely heavy, and we can drop it. (Equivalently, 
its propagator goes to zero.) The ghost fields also become infinitely heavy, 
but we must be more careful with them because their interaction term, the 
last line of eq. (85.38), also contains a factor of £. The vertex factor for this 
interaction is —i^g 2 v = — i(^M 2 )v~ l . Note that this is the same vertex 
factor that we found in unitary gauge for the ineraction between the p field 
and the ghost fields; see eq. (85.18) and take ?n 2 h = £M 2 . Thus we cannot 
drop the ghost fields, but we can take their propagator to be l/m 2 h rather 
than 1 /(k 2 + m 2 h ), since k 2 <C m 2 h = £M 2 in the limit £ — ► oo. This is 
the ghost propagator that we found in unitary gauge. We conclude that 
R £ gauge in the limit £ — > oo is equivalent to unitary gauge. Of course, in 
this limit, we reencounter the problems with divergent diagrams that led us 
to consider alternative gauge choices in the first place. For practical loop 
calculations, R £ gauge with £ = 1 is typically the most convenient. 

In the next section, we consider R g gauge for nonabelian theories. 
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86 Spontaneously Broken Nonabelian Gauge 

Theory 

Prerequisite: 85 

In the previous section, we worked out the lagrangian for a U(l) gauge 
theory with spontaneous symmetry breaking in R £ gauge. In this section, 
we extend this analysis to a general nonabelian gauge theory. 

As in section 85, it will be convenient to work with real scalar fields. 
We therefore decompose any complex scalar fields into pairs of real ones, 
and organize all the real scalar fields into a big list fa, i = 1 , ,N. These 
real scalar fields form a (possibly reducible) representation R of the gauge 
group. Let T a be the gauge-group generator matrices that act on 0; they 
are linear combinations of the generators of the SO(IV) group that rotates 
all components of 4>i into each other. Because these SO(N) generators 
are hermitian and antisymmetric, so are the T a ’s. Thus i(T a )ij is a real, 
antisymmetric matrix. 

The lagrangian for our theory can now be written as 

C = -\D^D^ - V (cf>) - \F a ^F; u , (86.1) 

where 

{Dn<t>)i = - ig a A°(T a ) i:j <l>j (86.2) 

is the covariant derivative, and the adjoint index a runs over all generators 
of all gauge groups. Because 4>i and A“ are real fields, and i(T a )ij is a real 
matrix, (D^(f>)i is real. 

Now we suppose that the potential V(4>) is minimized when (j) has a 
VEV 

(0| </>;(£) 1 0) = Vi . (86.3) 

A generator T a is unbroken if ( T a )ijVj = 0, and broken if ( T a )^Vj ^ 0. 

Each broken generator results in a massless Goldstone boson. To see 
this, we note that the potential must be invariant under a global gauge 
transformation, 

V((l-i6 a T a )(j)) = V{(f>) . (86.4) 

Expanding to linear order in the infinitesimal parameter 9, we find 

^-(T a ) jk( j) k = 0. (86.5) 

d( h 

We differentiate eq. (86.5) with respect to 4> k to get 
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Now set <f>i = vp, then dV/d<j>i vanishes, because </>* 
Also, we can identify 


d 2 V 


ym )ij = 




Pi=Vi 


Vi minimizes V(<j)). 


(86.7) 


as the mass-squared matrix for the scalars (after spontaneous symmetry 
breaking). Thus eq. (86.6) becomes 


{m 2 )ij{r a v)j = 0 . 


(86.8) 


We see that if T a v ^ 0, then T a v is an eigenvector of the mass-squared 
matrix with eigenvalue zero. So there is a zero eigenvalue for every linearly 
independent broken generator. 

Let us write 

M x ) = Vi + Xi(x) , (86.9) 

where Xi is a real scalar field. The covariant derivative of 0 becomes 


(D^i = d^Xi ~ igaA 1 {T a )ij{v + x)j (86.10) 

It is now convenient to define a set of real antisymmetric matrices 


(T a )ij = iga(T a )ij , ( 86 . 11 ) 

and the real rectangular matrix 

F a i = (r a ) ijVj . ( 86 . 12 ) 


We can now write 


(D^)i = d^Xi - A;(F a + r a X )i • (86.13) 

The kinetic term for cj) becomes 

-\D^D^ = -\d^ X id„Xi ~ \(F a iF b i)A a ^ + F a iA a ^ X i 

+ Alxi^h^Xj - A a ^F a i(T b ) ijXj 

- \A^Alxi{r a T b ) ijXj • (86.14) 

We see (from the second term on the right-hand side) that the mass-squared 
matrix for the vector fields is 

(M 2 ) ab = F a iF b i = ( FF T ) ab . (86.15) 

A theorem of linear algebra states that every real rectangular matrix can 
be written as 

F a i = S ab (M b 5 b j )R ji , 


(86.16) 
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where S and R are orthogonal matrices, and the diagonal entries M a are 
real and nonnegative. From eq. (86.15) we see that these diagonal entries 
are the masses of the vector fields. The vector fields of definite mass are 
then given by A“ = S ba A b ^. 

Now we are ready to fix R % gauge. To do so, we add to C the gauge- 
fixing and ghost terms 

t)G a 

C g{ +C gh = ^r 1 G a G a -^ lw c b , 

where we choose 

G a = &AI - tF\ Xi . 

Then we have 

£ gf = -\C l d^A a u + F^x^A* - ^(F a ,F a 

= - F\A a ^ X i ~ ^( Fa iF a j)XiXj • (86.19) 

We integrated by parts in the first two terms to get the second line. Note 
that the second term on the second line of eq. (86.19) cancels the annoying 
last term on the first line of eq. (86.14). Also, the last term on the second 
line of eq. (86.19) makes a contribution to the mass-squared matrix for the 
^ fields, 

aM%j = ZF'iF*, = ar T F)ij • ( 86 . 20 ) 

Eq. (86.16) tells us that the eigenvalues of this matrix are £ 1//2 M a , where 
M a are the vector-boson masses. The mass-squared matrix £M 2 should 
be added to the mass-squared matrix vn? that we get from the potential, 
eq. (86.7). Note that eqs. (86.8) and (86.12) imply that ( m 2 )ijF a j = 0; 
eq. (86.20) then yields (■ m 2 )ij(£M 2 )jk = 0. Thus these two contributions to 
the mass-squared matrix of the scalar fields live in orthogonal subspaces. 
The scalar fields of definite mass are Xi = RijXji where the block of R in the 
to 2 subspace is chosen to diagonalize to 2 . The m 2 subspace consists of the 
physical, massive scalars, and the £M 2 subspace consists of the unphysical 
Goldstone bosons; these are the fields that would be set to zero in unitary 
gauge. 

We must still evaluate £ g h- To do so, we recall that 9 a (x ) parameterizes 
an infinitesimal gauge transformation, 

A“ ^ A“ - D ab 9 b , (86.21) 

Xi -> -0 a {T a )ij(v + x)j ■ (86.22) 


(86.17) 

(86.18) 


5G a 

W 


Thus we have 


d^D; b + ^F a j (r b ) jk (v + x)k 
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= - d »D ab + £F a jF b j + iF a j{T b ) jk Xk 
= -d»D ab + i{M 2 ) ab + £ F a j(T b ) jk Xk , (86.23) 

and so the ghost lagrangian is 

£ gh = -d^(FD ab c b - i{M 2 ) ab (Fc b - ^F a j {T b ) jkX k^c b . (86.24) 

The ghost fields of definite mass are c a = S ba c b and c a = S ba c b . 

The complete gauge-fixed lagrangian is now given by eqs. (86.1), (86.14) 
(86.19), and (86.24). We can rewrite it in terms of the fields of definite mass. 
This results in the replacements 


F a i - 

■* M a S a i , 

(86.25) 

( T a )ij - 

■* S ab (R T T b R) rj , 

(86.26) 

jabc 

gad gbe gcg jdeg 

(86.27) 


throughout C. The Feynman rules then follow in the usual way. 

Problems 

86.1) Let ipi be a complex scalar field in a complex representation R of the 
gauge group. Under an infinitesimal gauge transformation, we have 
S(pi = -i0 a (Tg)i j (pj. Let us write w = -^{fa+ifa+d(R)) , where fa is a 
real scalar field with the index i running from 1 to 2d(R). Then, under 
an infinitesimal gauge transformation, we have 5 fa = —i9 a {T a )ij(j)j. 

a) Express T a in terms of the real and imaginary parts of T“. 

b) Show that the T a matrices satisfy the appropriate commutation 
relations. 
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87 The Standard Model: Gauge and Higgs 

Sector 

Prerequisite: 84 


We now turn to the construction of the Standard Model of elementary 
particles, also called the Glashow-Weinberg-Salam model. This is the com- 
plete (except for gravity) quantum field theory that appears to describe our 
world. It can be succinctly specified as a gauge theory with gauge group 
SU(3) x SU(2) x U(l), with left-handed Weyl fields in three copies of the 
representation (1, 2, — ^)©(1, 1, +1)©(3, 2, +|)©(3, 1, — §)©(3, 1, +|), and 
a complex scalar field in the representation (1,2, — \). Here the last entry 
of each triplet gives the value of the U(l) charge, known as hypercharge. 
The lagrangian includes all terms of mass dimension four or less that are 
allowed by the gauge symmetries and Lorentz invariance. 

We will construct the Standard Model over several sections. We begin 
with the electroweak part of the gauge group, SU(2)xU(l), and the complex 
scalar field <p, known as the Higgs field , in the representation (2, — ^). The 
Higgs field acquires a nonzero VEV that spontaneously breaks SU(2) x U(l) 
to U(l); the unbroken U(l) is identified as electromagnetism. 

We begin with the covariant derivative of the Higgs field ip, 


(D/j,<p)i = d^i - i[g 2 A“ L T a + giB^fipj , (87.1) 


where T a = \o a and Y = — \l\ Y is the hypercharge generator. It will 
prove useful to write out g 2 A^T a + g\B^Y in matrix form, 


g 2 A“T a + g^Y 


1 f 92 A* - g 1 B fl 
^ \52Mi + iA 2 ^) 


Now suppose that <p has a potential 


<72 (a ‘ - 1 a : 2 ) \ 

-92 A], -giB^J 


(87.2) 


V(<p) = |A(pV- \v 2 f . (87.3) 


This potential gives a nonzero VEV. We can make a global gauge trans- 
formation to bring this VEV entirely into the first component, and further- 
more make it real, so that 


(o|vG)|o> 


v /2 1 0 


(87.4) 


The kinetic term for ip is —(D^ip^D^ip. After replacing ip by its VEV, we 
find a mass term for the gauge fields, 


£ 


mass 


/ 92 A* - giB M g 2 (A) J - iA £) \ 2 / 1 

\92{A] 1 + iAl) -g 2 Al~ giB^J \ 


0 


(87.5) 
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To diagonalize this mass-squared matrix, we first define the weak mixing 
angle 

(87.6) 


and the fields 


0w = tan 1 (gi/g 2 ) , 

Z/l — — Sy/Bft , 


(87.7) 

(87.8) 

(87.9) 


An — s w A^ + c w B fi , 
where s w = sin 9 W , c w = cos0 w . In terms of these fields, eq. (87.5) becomes 

■ V2W- ... J Vo. 


Cr, 


= -hWii, o) 


= -(g 2 v/2) 2 W + ^W~ - \{g 2 v/2c w ) 2 Z^Z^ 


= -MlW + "W- - ^MlZ'-Z,, , 

(87.10) 

where we have identified 


M w = g 2 v/2 , 

(87.11) 

M z = M w /cos0 w . 

(87.12) 


The observed masses of the and Z° particles are M w = 80.4 GeV and 
M z = 91.2 GeV. Eq. (87.12) then implies cosf? w = 0.882, or, as it is more 
usually expressed, sin 2 # w = 0.223. 1 

Note that the A t , field remains massless; this signifies that there is an 
unbroken U(l) subgroup. We will identify this unbroken U(l) with the 
gauge group of electromagnetism. 

Before introducing leptons and quarks (which we do in sections 87 and 
88), let us work out the complete lagrangian for the gauge and Higgs fields, 
in unitary gauge. This is sufficient for tree-level calculations. 

The two complex components of the g> field yield four real scalar fields; 
three of these become the longitudinal components of the IV ^ and Z°. The 

x Of course, this number is only meaningful once a renormalization scheme has been 
specified. We are implicitly using an on-shell scheme in which 9 W is defined by the 
relation cos# w = M w /M z , where A/ w and M z are the actual particle masses. The 
relation g\ — <72 tan 6 W is then subject to loop corrections that depend on the precise 
definitions adopted for gi and 52 . In the MS scheme, on the other hand, 9w is defined 
by eq. (87.6), and for g = M z , we have sin 2 9 W = 0.231. 
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remaining scalar field must be able to account for shifts in the overall scale 
of p. Thus we can write, in unitary gauge, 


¥>(®) 


1 

71 


' v + H(x ) ' 
0 


(87.13) 


where H is a real scalar field; the corresponding particle is the Higgs boson. 
The potential now reads 

V(<p) = \\v 2 H 2 + |A vH 3 + ±\H a . (87.14) 

We see that the mass of the Higgs boson is given by = \\v 2 . (As of 
this writing, the Higgs boson has not been observed; the lower limit on its 
mass is m H > 115 GeV.) The kinetic term for H comes from the kinetic 
term for ip, and is the usual one for a real scalar field, —\d^Hd^H . Finally, 
recall that the mass term for the gauge fields, eq. (87.10), is proportional 
to v 2 . Hence it should be multiplied by a factor of (1 + v~ l H) 2 . 

Now we have to work out the kinetic terms for the gauge fields. We 


have 

r — I Tpafii'T^a 1 TDfiJAf) 

^ — 4 1 1 [11/ 4 ^ 5 

(87.15) 

where 

F lu = d u A l - 9 vA\ l + g 2 (AlAl - A 2 u Al) , 

(87.16) 


F 2 u = d^Al - d v A\ + g 2 {Al Al - A ®Aj) , 

(87.17) 


F jlv = 9n A l ~ 9uAj \ + g 2 (A A L Ai — A a A 2 ^) , 

(87.18) 


B, 1V = d fl B u - d u B lt . 

(87.19) 

Next, form the combinations F A U ± iF 2 u . Using eq. (87.7), 

we find 


7 = 2 {Fl„ ~ iF 2 u ) = D,W4 - D V W; , 

(87.20) 


-^(F A U + iF 2 u ) = D\W~ - DtW ~ , 

(87.21) 


where we have defined a covariant derivative that acts on Wit, 

h 1 

Dfj, = dp, ig 2 Al 

= d^- ig 2 (s w A /1 + c w Z M ) . (87.22) 


If we identify A /t as the electromagnetic vector potential, and assign electric 
charge Q = +1 (in units of the proton charge) to the W + , then we see from 
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eq. (87.22) that we must identify the electromagnetic coupling constant e 
as 

e = (j 2 sin 9 W . (87.23) 

Here we are adopting the convention that e is positive. (In our treatment 
of quantum electrodynamics, we used the convention that e is negative, but 
that is less convenient in the present context.) 

We also have 

= d„Al - dyA% - ig 2 (W+W- - W+W~) 

= s w F^ v + CwV - ig 2 {W+W~ - W+W ~ ) , (87.24) 

— Cy/F^y , (87.25) 

where F^ u = d^Ay — d u A M is the usual electromagnetic field strength, and 

= d^Zy - dyZ M (87.26) 

is the abelian field strength associated with the Z ]L held. 

Now we can assemble all of this into the complete lagrangian for the 
electroweak gauge fields and the Higgs boson in unitary gauge. We will 
express g 2 in terms of e and 9 W via g 2 = e/sin 9 W , and A in terms of rn H 
and v via A = 2?n|/u 2 . We ultimately get 

C = - IZ^Z^y - D^W~ v D ll W+ + D^W~ U D U W+ 

+ ie{F^ + cot OyrZ^W+W- 

- i(e 2 /sin 2 dy,)(W +li W~W +v W~ - W +ti W+W~ v W~) 

- (MlW^W~ + ±M|Z^)(1 + v~ x Hf 

- ^d^Hd^H — \m\ H 2 — — |m|u _2 /7 4 , (87.27) 

where 

D n = d^, ie(A^ + cot 0 W Z M ) . (87.28) 

With the W+ held assigned electric charge Q = +1, this lagrangian exhibits 
manifest electromagnetic gauge invariance. The full underlying SU(2) x 
U(l) gauge invariance is not manifest, however, because we have hxed uni- 
tary gauge. 


Reference Notes 

Discussions of the Standard Model in R ^ gauge can be found in Cheng & 
Li and Ramond II. 


Problems 
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87.1) Find the generator Q of the unbroken U(l) subroup as a linear com- 
bination of the T a ’s and Y . 


87.2) a) Ignoring loop corrections, find the numerical values of v, g \ , and 
(j 2 ■ Take e 2 /4ir = a(M z ) = 1/127.9 and sin 2 0 W = 0.231. 

b) The Fermi constant is defined (at tree level) as 


G f 


4a/ 2 sin 2 0 w M%, 


(87.29) 


Find its numerical value in GeV 2 . 
c) Express G f in terms of v. 

87.3) In this problem we will work out the generator matrices introduced 
in section 86 for the case of the Standard Model. 

a) Write the Higgs field as 


1 /(/>!+ ifo \ 

v/2 \ 4>2 + i(t>4 ) 


(87.30) 


where cj)i is a real scalar field. Express the SU(2) generators T a and 
the hypercharge generator Y as 4 x 4 matrices T a and y that act on 
i f>i. Hint: see problem 86.1. 

b) Compute the matrix F a i, defined in eq. (86.12). 

c) Compute the mass-squared matrix for the vector fields, (M 2 ) ah = 
F a iF b i , and find its eigenvalues. 


87.4) Work out the Feynman rides for the lagrangian of eq. (87.27). 

87.5) Assume that m H > 2 M z , and compute (at tree level) the decay rate of 
the Higgs boson into W + W~ and Z°Z° pairs. Express your answer 
in GeV for m H = 200 GeV. 
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88 The Standard Model: Lepton Sector 

Prerequisite: 75, 87 

Leptons are spin-one-half particles that are singlets of the color group. 
There are six different flavors of lepton; see Table 2. The six flavors are 
naturally grouped into three families or generations: e and v e , p and 
r and v T . 

Let us begin by describing a single lepton family, the electron and its 
neutrino. We introduce left-handed Weyl fields t and e in the representa- 
tions (2, — |) and (1,+1) of SU(2) x U(l). Here the bar over the e in the 
field e is part of the name of the field, and does not denote any sort of 
conjugation. The covariant derivatives of these fields are 

(iV)i = ~ m ^XT a )Hj - i gi (-l)B^i , (88.1) 

D^e = <9 M e - ig x (+1 )B fJ e , (88.2) 

and their kinetic terms are 

£kin = i&d^fiD^i + ie^d^D^e . (88.3) 

The representation (2,— ^)©(1,+1) for the left-handed Weyl fields is com- 
plex; hence the gauge theory is chiral, and therefore parity violating. 

We cannot write down a mass term involving t and/or e because there 
is no gauge-group singlet contained in any of the products 

(2,-i) 0 (2,— i), 

(2,4) 0 (1,+1) , 

(1,+1) 0 (1,+1) • (88.4) 

However, we are able to write down a Yukawa coupling of the form 

-Cyuk = -ye^tpitje + h.c. , (88.5) 

where (p is the Higgs field in the (2, — ^) representation that we introduced in 
the last section, and y is the Yukawa coupling constant. A gauge-invariant 
Yukawa coupling is possible because there is a singlet on the right-hand 
side of 

(2, -±) 0 (2, 4) 0 (1, +1) = (1, 0) © (3, 0) . (88.6) 

There are no other gauge-invariant terms involving t or e that have mass 
dimension four or less. Hence there are no other terms that we could add 
to C while preserving renormalizability. 

We add eqs. (88.3) and (88.5) to the lagrangian for and the gauge 
fields that we worked out in the last section. In unitary gauge, we replace 
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name 

symbol 

mass 

(MeV) 

Q 

electron 

e 

0.511 

-l 

electron neutrino 

v e 

0 

0 

muon 

y 

105.7 

-l 

muon neutrino 


0 

0 

tau 

T 

1777 

-1 

tau neutrino 

V T 

0 

0 


Table 2: The six flavors of lepton. Q is the electric charge in units of 
the proton charge. Each charged flavor is represented by a Dirac field, each 
neutral flavor by a Majorana field (or, equivalently, a left-haned Weyl field). 
Neutrino masses are exactly zero in the Standard Model. 


ipi with A=(v+H), where H is the real scalar field representing the physical 
Higgs boson, and with zero. The Yukawa term becomes 

-Cvuk = ~^V( V + H)(£ 2 e + h.c.) . (88.7) 

It is now convenient to assign new names to the SU(2) components of l, 

e= [[)- ( 88 - 8 ) 

(We will rely on context to distinguish the field e from the electromagnetic 
coupling constant e.) Then eq. (88.7) becomes 

Auk = ~^y( v + H)(ee + eV) 

= ~^y(v + H)SS (88.9) 

where we have defined a Dirac field for the electron. 


£ = 



We see that the electron has acquired a mass 


m e = 


yy 

v/2 ' 


( 88 . 10 ) 


(88.11) 


The neutrino has remained massless. 
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We can describe the neutrino with a Majorana field 

v= t)- (8fU2) 

However, it is often more convenient to work with 

AA l = P l AA=Q, (88.13) 

where P L = ^( 1 — 75 ). We can think of A/" L as a Dirac field; for example, the 
neutrino kinetic term iu^d^d^u can be written as iAT L ^A/" L . 

Now we return to eqs. (88.1) and (88.2), and express the covariant 
derivatives in the terms of the Z^, and A M fields. From our results in 
section 87, we have 

92 a;t' +92 ^ = ^(J_ ”;) ( 8814 > 

and 

<72 ^T 3 + giB^y = + c w Z /i )T 3 + -^-{cy/An — s w Z^)Y 

= e{A^ + cot 9 W Z^)T 3 + e(A^ - tan 9 W Z I1 )Y 
= e(T 3 + Y)A fl + e(cot 9 W T 3 — tan 9 w Y)Z fl . (88.15) 


Since we identify A M as the electromagnetic field and e as the electromag- 
netic coupling constant (with the convention that e is positive) , we identify 


Q = T 3 + Y 

as the generator of electric charge. Then, since 

(88.16) 

T'V = + \u , 

T 3 e = -±e, 

T 3 e = 0 , 

(88.17) 

Yu = — \u , 

we see from eq. (88.16) that 

Ye = — \e , 

Ye = +e , 

(88.18) 

II 

0 

Qe = — e , 

Qe = +e . 

(88.19) 


This is just the set of electric charge assignments that we expect for the 
electron and the neutrino. Then (since the action of Q on the fields is more 
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familiar than the action of Y) it is convenient to replace Y in eq. (88.15) 
with Q — T 3 . We find 

, 92 - 4 /jT 3 + giB^Y = eQA M + e[(cot 9 W + tan 9 W )T 3 - tan 9 W Q\Z^ 

= eQA + s^(T 3 - slQ)Z^ . (88.20) 

In terms of the four-component fields, we have 

{ 92 AIT 3 + giBfjY)£ = — eA^ + Sw e Cw + s%,)Z^ £ , (88.21) 

( 92 A 3 T 3 + ffl £? M Y)A/- L = ^(+i)^ A7 L . (88.22) 

Using eqs. (88.14) and (88.21-88.22) in eqs. (88.1-88.3), we find the coup- 
ings of the gauge fields to the leptons, 

Ant = ^92Wp~^ + -^92W;J + » + s^Z,X + eApJZ M , (88.23) 


where we have defined the currents 

J +fl = , (88.24) 

, (88.25) 

Jz = J3 — s w^em j (88.26) 

4 = ±^ 7 ^ - i? L 7 ^ L , (88.27) 

Jg M = -£^£ . (88.28) 


In many cases, we are interested in scattering amplitudes for leptons 
whose momenta are all well below the W± and Z° masses. In this case, we 
can integrate the and Z lx helds out of the path integral, as discussed 
in section 29. We get the leading term (in a double expansion in powers 
of the gauge couplings and inverse powers of M w and M z ) by ignoring the 
kinetic energy and other interactions of the W± and Z jx fields, solving the 
equations of motion for them that follow from £ mass + Ant, where £\ n \, is 
given by eq. (88.23) and 

Anass = -MlW+AV- - l 2 M'iZ"Z lt , (88.29) 

and finally substituting the solutions back into £ mass + Ant • This is equiv- 
alent to evaluating tree-level Feynman diagrams with a single W± or Z° 
exchanged, with the propagator g 1 '" /M'^ vz . The result is 

2 2 
r „ — a ' 2 j+e- j- a £ j 

US ~2MI J A + 2 .s^Mz 2ZZ " 


2 slMl 


{J + M~ + J ZM ) 


= 2v / 2G F (J +/i .7 / 7 + .A.7 Z/i ) . 


(88.30) 
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Figure 88.1: Feynman diagram for muon decay. The wavy line is a IF 
propagator. 


We used e = g 2 sin 9 W and M w = M z cos 9 W to get the second line, and we 
defined the Fermi constant 

e 2 

G f = — -= ^ (88.31) 

4\/2 sin 2 (9 w M^ 

in the third line. We can use C e $ to compute the tree-level scattering 
amplitude for processes like v e e~ —> v e e~\ we leave this to the problems. 

Having worked out the interactions of a single lepton generation, we 
now examine what happens when there is more than one of them. Let us 
consider the fields and e 7 , where I = 1,2,3 is a generation index. The 
kinetic term for all these fields is 

£kin = + ie^D^er , (88.32) 

where the repeated generation index is summed. The most general Yukawa 
term we can write down now reads 


£y u k = -£ lj Vitjiyuej + h.c. , (88.33) 

where y u is a complex 3x3 matrix, and the generation indices are summed. 
We can make unitary transformations in generation space on the fields: 
i I — > Ljjij and e t —> E u ej , where L and E are independent unitary 
matrices. The kinetic terms are unchanged, and the Yukawa matrix y is 
replaced with L T yE. We can choose L and E so that L T yE is diagonal with 
positive real entries y,. The charged leptons then have masses m ei = 
y^/y/2, and the neutrinos remain massless. In the currents, eqs. (88.24- 
88.28), we simply add a generation index I to each field, and sum over 
it. 

Let us work out the details for one process of particular importance: 
muon decay, — > e~T/ e v tl . Let the four-component fields be £ for the 

electron, J\4 for the muon, JV e for the electron neutrino, and J\f m for the 
muon neutrino. Only the charged currents J± are relevant; the neutral 
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current J% and the electromagnetic current J RM do not contribute. Ignoring 
the r terms in the charged currents, we have 

J +M = £^A U + , (88.34) 

J ■"'* = . (88.35) 

The relevant term in the effective interaction is 

£ eff = 2V2G F (£ h ^Af eL ){Jf mL ^M h ) . (88.36) 

This can be simplified by means of a Fierz identity (see problem 36.3), 

C eS = -4^G f (!M c P l AQ(TP r AO . (88.37) 

Assigning momenta as shown in fig. (88.1), and using the usual Feynman 
rules for incoming and outgoing particles and antiparticles, the scattering 
amplitude is 

T = -W2G ¥ {ulCP^ 2 ){u' z P n Cu^) 

= -4v^G F (viP L ^)(«^P R ui) . (88.38) 

Taking the complex conjugate, and using P h = P R , we find 

T* = -Ay/2G F (v , 2 P R v 1 )(v , 1 P Ij u , 3 ) . (88.39) 

Multiplying eqs. (88.38) and (88.39), summing over final spins and averag- 
ing over the initial spin, we get 

(\T\ 2 ) = i(4v / 2) 2 G|TV[(-V 1 -m M )P L (-/ 2 )P R ] 

x Tr[(-/ 3 +m e )P R (-/ 1 )P L ] . (88.40) 

The traces are easily evaluated, with the result 

(|T| 2 ) = 64G2(p 1 p')(p' 1 p'). (88.41) 

We get the decay rate T by multiplying (|T| 2 ) by dLIPS 3 (pi) and integrat- 
ing over p'i 2 3 . We worked out the result (in the limit m e <C m /t ) in problem 
11.3, 

Gp m\ 

r = -^-4 . 88.42) 

192t r 3 v ’ 

After including one-loop corrections from electromagnetism, and account- 
ing for the nonzero electron mass, the measured muon decay rate is used 
to determine the value of G F , with the result Gp = 1.166 x 10 -5 GeV -2 . 
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Reference Notes 

Lepton phenomonology is covered in more detail in in Cheng & Li , Georgi, 
Peskin & Schroeder , Quigg, and Ramond II. 

Problems 

88.1) Verify the claim made immediately after eq. ( 88 . 6 ). 

88.2) Show that a neutrino always has negative helicity, and that an an- 
tineutrino always has positive helicity. Hint: see section 75. 

88.3) Show that the sum of eqs. (88.32) and (88.33), when rewritten in terms 
of fields of definite mass, has a global symmetry U(l) x U(l) x U(l). 
The corresponding charges are called electron number , muon number , 
and tau number, the sum of the charges is the lepton number. List 
the value of each charge for each Dirac field £j and A/" L/ . 

88.4) Compute (|T| 2 ) for muon decay using eq. (88.36), without making the 
Fierz transformation to eq. (88.37), and verify eq. (88.41). 

88.5) a) Write down the term in C c g- that is relevant for and zz^e - — > . 

Express your answer in the form 

Ces = ^G F jr^(l-^ 5 )M£^(C v -C Al5 )£ , (88.43) 

where M is the muon neutrino field, and determine the values of C v 
and C A . 

b) Repeat part (a) for u e e~ — > v e e~ . 

c) Compute (|T| 2 ) as a function of the Mandelstam variables, the 
electron mass, and C v and C A . 

88 . 6 ) Compute the rates for the decay processes W + —> e + v e , Z° — ► e + e~, 
and Z° —> v e v e . Neglect the electron mass. Express your results in 
GeV. 

88.7) Anomalous dimension of the Fermi constant. The coefficient of the 
effective interaction for muon decay, eq. (88.36), is subject to renor- 
malization by quantum electrodynamic processes. In particular, we 
can compute its anomalous dimension 7 q, defined via 

p—Gp(p) = 7 g(o)Gf(^) , (88.44) 

dp 

where a = e 2 / 47 r is the fine-structure constant in the MS scheme with 
renormalization scale p. 
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a) Argue that it is G F (M W ) that is given by eq. (88.31). 

b) Multiply eq. (88.36) by a renormalzing factor Z G , and define 

In (Z G /Z 2 ) = jr , (88.45) 

n= 1 £ 

where Z 2 is the renormalizing factor for a field of unit charge in spinor 
electrodynamics. Show that 

7 G (a) = otQ[{a) . (88.46) 


c) If 7 G (a) = ci a + 0(a 2 ) and /3(a) = h\a 2 + 0(a 3 ), show that 

a(/i) ~\ Cl ! bl 


Gf(p) = 


a (My, 


Gf (My 


(88.47) 


for p < M w . (For p > M w , we should not be using an effective 
interaction.) 

d) If a(p) ln(M w /p) <C 1, show that eq. (88.47) becomes 


G F (p) 


1 — cia(^) \n(M w /p) Gp(My 


(88.48) 


e) Use a Fierz identity to rewrite eq. (88.36) in charge retention form , 

£ cff = 2V2 Z G G F (£ h Y l M L )(Af m ^Ke) • (88.49) 

f) Consider the process of muon decay with an extra photon con- 
necting the p and e lines. Work in Lorenz gauge, and with the four- 
fermion vertex provided by eq. (88.49). Use your results from problem 
62.2 to show that, in this gauge, there is no 0(a ) contribution to Z G 
in the MS scheme. 

g) Use your result from part (d), and your result for Z -2 in Lorenz 
gauge from problem 62.2, to show that ci = 0, and hence that 
Gp(p) = G F (M W ) at the one-loop level. 
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89 The Standard Model: Quark Sector 

Prerequisite: 88 

Quarks are spin-one-half particles that are triplets of the color group. There 
are six different flavors of quark; see Table 1 in section 83. The six flavors 
are naturally grouped into three families or generations : u and d, c and s, 
t and b. 

Let us begin by describing a single quark family, the up and down 
quarks. We introduce left-handed Weyl fields q , u, and d in the represen- 
tations (3, 2, +g), (3, 1, — |), and (3, 1, +^) of SU(3) x SU(2) x U(l). Here 
the bar over the letter in the fields u and d is part of the name of the field , 
and does not denote any sort of conjugation. The covariant derivatives of 
these fields are 


(Df,q)ai = dfiQai - qpi - i^A^Tf)^ qgj 

(89.1) 

{D^u) a = d^vT - ig 3 A“(T$) a puP - ic/i(-|)R M u a , (89.2) 

(D^df = d,J a - ig :i A;jTf) a 0cP - i gi {+\)B^d a . (89.3) 

We rely on context to distinguish the SU(3) gauge fields from the SU(2) 
gauge fields. The kinetic terms for q, u, and d are 

£kin = iq^d^iD^ai + iu ] a d^{D^u) a + id ] a d^ L (D ^d) a . (89.4) 

The representation (3,2,+g) © (3,1,— |) © (3,1,+^) for the left-handed 
Weyl fields is complex; hence the gauge theory is chiral, and therefore 
parity violating. 

We cannot write down a mass term involving q, u, and/or d because 
there is no gauge-group singlet contained in any of the products of their 
representations. But we are able to write down Yukawa couplings of the 
form 

-Cvuk = -y'£ lJ Viqajd a ~ y”^ l q a iU a + h.c. , (89.5) 

where (f is the Higgs field in the (1, 2, — ^) representation that we introduced 
in section 87, and y' and y" are the Yukawa coupling constants. These 
gauge-invariant Yukawa couplings are possible because there are singlets 
on the right-hand sides of 

(l,2,-i)®(3,2,+i)®(3,l,+i) = (1,1,0)®... , 


(l,2,+±)<g)(3,2,+±)<g>(3,l,— §) = (1,1,0)©... • 


(89.6) 

(89.7) 
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There are no other gauge-invariant terms involving q, u, or d that have 
mass dimension four or less. Hence there are no other terms that we could 
add to £ while preserving renormalizability. 

In unitary gauge, we replace (p i with + H), where H is the real 
scalar field representing the physical Higgs boson, and tp 2 with zero. The 
Yukawa term becomes 

£y u k = + H)q a2 d Q - ^ y"(v + H)q al u Q + h.c. . (89.8) 

It is now convenient to assign new names to the SU(2) components of q , 


9 =UJ' <89 - 9) 

Then eq. (89.8) becomes 

£y u k = “7 ^v\ v + H)(d a d a + S a S a ) - y"(v + H)(u a u a + « ta ) 

= ~^y\v + H)V a V a - ±y'\v + H)TtU a , (89.10) 

where we have defined Dirac fields for the down and up quarks, 


V 


a — 




(89.11) 


We see from eq. (89.10) that the up and down quarks have acquired masses 


m d = 


y'y 

v/2 


m u = 


y% 

V2 


(89.12) 


Now we return to eqs. (89.1-89.3), and express the covariant derivatives 
in the terms of the W±, Z and A ^ fields. From our results in section 88, 
we have 


g 2 AlT 1 +g 2 AlT 2 


92 f 0 

V2\W~ 



g 2 AlT 3 + gi B^Y = eQA, + ^(T 3 - s%Q)Z» , 

where 

Q = T 3 + Y 

is the generator of electric charge. Then, since 


(89.13) 

(89.14) 

(89.15) 


T 3 u = +\u , T 3 d = — \d , T 3 u = 0 , T 3 d = 0 , (89.16) 

Yu = +^u, Yd = +\d , Yu = —^u, Yd = +^d, (89.17) 


we see from eq. (89.15) that 
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Qu = +§rt Qd = —\d , Qu = — §u , Qd = +^d . (89.18) 

This is just the set of electric charge assignments that we expect for the up 
and down quarks. In terms of the four-component fields, we have 

(52^T 3 + giB^Y)U = +| eAfj, + Sw 6 Cw (+|-Pl - § U , (89.19) 

(d2^T 3 + giB^Y )V = + Sw e Cw (— ^-P L + jSw)^ ^ • (89.20) 

Using eqs. (89.13) and (89.19-89.20) in eqs. (89.1-89.4), we find the coup- 
ings of the electroweak gauge fields to the quarks, 

/lint = + ^g 2 W-J + ^ + + eAXu , (89.21) 


where we have defined the currents 

•/ +,t = VXUr , (89.22) 

= Z7l7^l , (89.23) 

4 = 4- sl4 M , (89.24) 

4 = - /P l7 ^ l , (89.25) 

4 m = +\U^U - \V4V . (89.26) 


Having worked out the interactions of a single quark generation, we 
now examine what happens when there is more than one of them. Let us 
consider the fields ttj, and d 7 , where I = 1,2,3 is a generation index. 
The kinetic term for all these fields is 

/I kin = iq^ mI d fl {D fi ) a 4qp jI + + idli^iDXpd'i , 

(89.27) 

where the repeated generation index is summed. The most general Yukawa 
term we can write down now reads 

Zlvuk = -£ tJ TiQajiy'iX ~ V ] \aiiy"jU a j + h.c. , (89.28) 

where y' u and y'{j are complex 3x3 matrices, and the generation indices 
are summed. In unitary gauge, this becomes 

Zlvuk = ~^( v + H ) d aiy'udj ~ + H)u a iy”ju a j + h - c - • (89.29) 

We can make unitary transformations in generation space on the fields: 
dj ^ Dudj, d 7 — > D u dj, — >■ U u Uj , and u x — > UuUj, where U, D, U 

and D are independent unitary matrices. The kinetic terms are unchanged 
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(except for the couplings to the W±, as we will discuss momentarily), and 
the Yukawa matrices y' and y" are replaced with D T y' D and U T y"U. We 
can choose D, D, U , and U so that D T y' D and U T y"U are diagonal with 
positive real entries y\ and y'{. The down quarks V T then have masses 
rricjj = y^v/V 2, and the up quarks £4 have masses m Ul = y"v/\[2. In 
the neutral currents J 3 and J^ M , we simply add a generation index I to 
each field, and sum over it. The charged currents are more complicated, 
however; they become 


J +M = , (89.30) 

■r fl = , (89.31) 

where V = U^D is the Cabibbo-Kobayashi-Maskawa matrix (or CKM ma- 
trix for short). Note that we did not have this complication in the lepton 
sector, because there we had only one Yukawa term. 

A 3 x 3 unitary matrix has 9 real parameters. However, we are still free 
to make the independent phase rotations V, — ► e* a/ X > 7 and 
as these leave the kinetic and mass terms invariant. These phase changes 
allow us to make the first row and column of Vu real, eliminating 5 of the 
9 parameters. The remaining four can be chosen as 6 \ (the Cabibbo angle), 
62 , 9 3 , and 5, where 


V = 


( Cl 

-sic 2 

V-S1S2 


+S1C3 

cic 2 c 3 - s 2 s^e l5 
ci-s 2 c 3 + c 2 s 3 e lS 


+si«3 \ 

cic 2 s 3 + s 2 c 3 e lS 
cis 2 s 3 - c 2 c 3 e lS J 


(89.32) 


and Cj = cos0j and s t = sin0j. The measured values of these angles are 
si = 0.224, s 2 = 0.041, S 3 = 0.016, and 6 = 40°. Note that the charged 
currents now have some terms with a phase factor e iS , and some without. 
Since the time-reversal operator T is antiunitary (T~ 1 iT = —i), the charged 
currents do not transform in a simple way under time reversal. This implies 
that the charged current terms in C m t are is not time-reversal invariant; 
hence the electroweak interactions violate time-reversal symmetry. Since 
CPT is always a good symmetry, time-reversal violation is equivalent to 
CP violation; 5 is therefore sometimes called the CP violating phase. 

At high energies, we can use our results to compute electroweak con- 
tributions to scattering amplitudes involving quarks. This is because, at 
high energies, the SU(3) coupling g 3 is weak; we can, for example, re- 
liably compute the decay rates of the W ± and Z° into quarks, because 
a 3 (M z ) = gl(M z )/ATT = 0.12 is small enough to make QCD loop correc- 
tions a few-percent effect. 
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To understand low-energy processes such as neutron decay, we must first 
write the currents in terms of hadron fields. We take this up in the next 
section. For now, we simply note that the terms in the charged currents 
that involve only up and down quarks are 

J +M = ciP L y*W L , (89.33) 

J-M = Cl z7 L y*P L , (89.34) 

where c\ is the cosine of the Cabibbo angle. 

Reference Notes 

Electroweak interactions of quarks are discussed in more detail in Cheng & 
Li, Georgi, Peskin & Schroeder, Quigg, and Ramond II. 

Problems 

89.1) Verify the claims made immediately after eqs. (89.4) and (89.7). 

89.2) Compute the rates for the decay processes W + —> ud , Z° — > uu, 
and Z° —> dd. Neglect the quark masses. Express your results in 
GeV. Combine your answers with those of problem 88.6, and sum 
over generations to get the total decay rates for the and Z ° . You 
can neglect the masses of all quarks and leptons except the top quark, 
and take 9-2 = 0% = 0. 

89.3) Show that the Standard Model is anomaly free. Hint: you must 
consider 3-3-3, 2-2-2, 3-3-1, 2-2-1, and 1-1-1 anomalies, where the 
number denotes the gauge group of one of the external gauge fields 
in the triangle diagram. Why do we not need to worry about the 
unlisted combinations? 

89.4) Compute the leading term in the beta function for each of the three 
gauge couplings of the Standard Model. 

89.5) After integrating out the fields, we get an effective interaction 
between the hadron and lepton currents that includes 

£ cff = 2V2Z c C(E^Xer){U^V h ) , (89.35) 

where we have defined C = c\Gy at a renormalization scale // = M w , 
and Zc is a renormalizing factor. This interaction contributes to 
neutron decay; see section 90. In this problem, following the analysis 
of problem 88.7, we will compute the anomalous dimension qc of C 
due to one-loop photon and gluon exchange. 
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a) Use Fierz identities to show that eq. (89.35) can be rewritten as 

C eS = 2\/2Z c C(F i y i P0(Z7 i yW eL ) , (89.36) 

and also as 

£ e ff = -^V2Z c C{V c P^U e )(EP^U c ) . (89.37) 

b) Working in Lorenz gauge and using the results of problem 88.7, 
show that gluon exchange does not make a one-loop contribution to 

Zc- 

c) Show that only a photon connecting the e and u lines makes a 
one- loop contribution to Zc- 

d) Note that £P K U C = e'u\ and compare this with ££ = + 

h.c.. Argue that the photon-exchange contribution to Zc is given by 
the one-loop contribution to Z m in spinor electrodynamics in Lorenz 
gauge, with the replacement (— l)(+l)e 2 — » (— l)(+|)e 2 . 

e) Let 7c(«) = cia + . . . , where a = e 2 /4-7r, and find ci. (This ci 
should not be confused with the cosine of the Cabibbo angle.) 
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90 Electroweak Interactions of Hadrons 

Prerequisite: 83, 89 

Now that we know how quarks couple to the electroweak gauge fields, we 
can use this information to obtain amplitudes for various processes in- 
volving hadrons. We will focus on three of the most important: neutron 
decay, n — > pe~v] charged pion decay, tt~ — > and neutral pion decay, 

7T° — > 77. 

Recall from section 83 the chiral lagrangian for pions and nucleons, 

C = - \fl Tr dfJJ + v 3 Tr (MU + C/ t ) 

+ iNftN - m N N(U^P L + UP r )N 
- \{g A -l)iN^{Ud^P R + U^d^UP n )N , (90.1) 

where U(x) = exp[2iir a (x)T a /f n ], n a is the pion field, N is the nucleon field, 
f n is the pion decay constant, M is the quark mass matrix, v 3 is the value 
of the quark condensate, m n is the nucleon mass, and g A is the axial vector 
coupling. The electroweak gauge group SU(2) x U(l) is a subgroup of the 
SU(2 )l x SU(2 )r x U(l)v flavor group that we have in the limit of zero 
quark mass. It will prove convenient to go through the formal procedure 
of gauging the full flavor group, and only later identifying the electroweak 
subgroup. We therefore define matrix-valued gauge fields Z M (x) and r M (x) 
that transform as 



l/j, — * Ll^lf + iLd^lJ , 

(90.2) 


— > Rr^R* + iRd^R^ . 

(90.3) 

Here L(x ) and R(x) are 2x2 unitary matrices that correspond to a general 
SU(2 )l x SU(2 )r x U(l)y gauge transformation; we restrict the U(l) part 
of the transformation to the vector subgroup by requiring det L = det R 
and TrZ^ = Tr r^. The transformation rules for the pion and nucleon fields 

are 

u — lur) , 

(90.4) 


iV L LiV L , 

(90.5) 


N r ^ RN k , 

(90.6) 


where 1V L = P L N and N n = P R N are the left- and right-handed parts of 
the nucleon field. 

We can make the chiral lagrangian gauge invariant (except for terms 
involving the quark masses) by replacing ordinary derivatives with appro- 
priate covariant derivatives. We determine the covariant derivative of each 
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field by requiring it to transform in the same way as the held itself; for 
example, D^U —> L(D fJ U)R^ . We thus find 


DfJJ = d„U - il„U + iU , (90.7) 

D ^ + irtlp - ir prt , (90.8) 

DfjNj l = (<9 M - ?7 M )iV L , (90.9) 

Df,N R = (d tl - ir M )iV R . (90.10) 


Making the substitution d — ► D in £, we learn how the pions and nucleons 
couple to these gauge fields. 

As in section 83, it is more convenient to work with the nucleon held 
A f, dehned via 


N = (uP L + u^P R )Af , 


(90.11) 


where u 2 = U. Making this transformation, we ultimately find 


C = -\fl Tr {dPtfdpU - iPUdfrf - 

+ + r% - WUrjfl) 

+ v 3 Tr (MU + M t U ] ) + iJJfiN - m N JIU 
+ A T{f + \l+ \f)N - g A JJ{</i + \l~ , (90.12) 


where 


V fj, = \i[u\d^u) 

+ u(d^)\ , 

(90.13) 

= \i[u ] {d^u) 

- u(d^ L u ] )\ , 

(90.14) 

s 

-A 

III 


(90.15) 

= ur^u' . 


(90.16) 

It is now convenient to set 



(3 

bn 

i-".* 

II 

+ bfj, , 

(90.17) 

r — r a T a 

Pbjj, . 

(90.18) 


We have normalized b ^ so that the corresponding charge is baryon number. 
The SU(2) gauge helds of the Standard Model can now be identihed as 


92 A“ = i; , (90.19) 

and the electromagnetic gauge held as 


ey V — ^ + v Il + ■ 


(90.20) 
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Eq. (90.20) follows from reconciling eqs. (90.9) and (90.10) with the re- 
quirement that the electromagnetic covariant derivatives of the proton field 
p = J\f\ and the neutron field n = A /2 be given by (<9 M — ieA^)p and d^n. 

We can now find the hadronic parts of the currents that couple to the 
gauge fields by differentiating C with respect to them, and then setting 
them to zero. We find 




J7 


(dc/di;) 

^ I l=r=0 

\iflTrT a U & “[ft + \-Mv)T a r{l~ 9 ^)uM 

+\Ud^ a - ±£ abc ir b d^n c + \JfT a ^{l-g Alb )M + ... , (90.21) 

m/K ) , n 
^ L=r = 0 

\ifl Tr T a U^U + 

-\f^i T a - ±£ abc Tr b d^Tr c + iATT a 7^(l+ 5A 7 5 )AA + . . . , (90.22) 


jz = ^Mi =r =o 

= A T'fN . 


(90.23) 


In the third lines of eqs. (90.21) and (90.22), we have expanded in inverse 
powers of f n . We can now identify the currents that couple to the physical 
and A M fields as 

J +fM = C1 (J - Ul *) 

= ^ci(/ vr c> At 7r + + Z7r°^7r+) + |ci n 7 ^(l— ^ 75 ) 4 * + • • • , (90.24) 

+ U L 2/i ) 

= -^ci(f n d^TT~ - d^ 1 tt ~ ) + |ciP7 m (1-5a75)« + • • • , (90.25) 

Jz = Jg 4 — S w JeM j (90.26) 

t/' r3/i 

J 3 — 

= |(/ 7 r<9 /4 '7r° + i7r + 9 /i 7T _ ) 

+ iP7 m (!-5a75)p - ira7 M ( 1 -fi'A75)n + • • • , (90.27) 

tM _i_ 1 1 

J EM — 1 I 2^B 

= i-K + d^TT~ + p^f^p + . . . , 


(90.28) 
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where c\ is the cosine of the Cabibbo angle, and the interactions are spec- 
ified by 

Ant = ^92 W+J + -^g 2 W li J +,J ' + Sw e Cw Z,J? + eA^j£ M . (90.29) 

For low-energy processes involving W± or Z° exchange, we can use the 
effective current-current interaction that we derived in section 88, 

As = 2V2 G f (J + % + . (90.30) 

We should include both hadronic and leptonic contributions to the currents. 

Consider charged pion decay, n~ — -> p~V^. The relevant terms in the 
charged currents (neutral currents do not contribute) are 

= H= Cl f n d^TT^ , (90.31) 

J+m = , (90.32) 

where JA is the muon held and M m is the muon neutrino held. The relevant 
term in the effective interaction is then 


As = G F cif n d^i: Al7 M (l-7 5 )A/' m . 

The corresponding decay amplitude is 

T = G F cif n k^ Fi7 m (1— 75)^2 , 


(90.33) 

(90.34) 


where the four-momenta of the pion, muon, and antineutrino are k, p \ , 
and P 2 ■ Eq. (90.34) can be simplihed by using ^ + V 2 a l° n g with 

IZi^i = —ni/jUi and rf 2 v 2 = 0; we get 

T = —G F c\f n m IJ ui(l— r )^)v2 • (90.35) 


We see that T is proportional to the muon mass; since m /t S> m e , decay to 
H~Vn is preferred over decay to e~V e . 

Squaring T and summing over final spins, we find 

{\T\ 2 ) = (G F ci/ 7r m /i ) 2 (-8pi-p 2 ) 

= 4(G F ci/ 7r m M ) 2 (m 2 - ?n 2 ) . (90.36) 

We used -2pi-p 2 = p\ +p 2 — {P1+P2) 2 — ~ m. 2 + 0 + m 2 to get the second 
line. We now have 

r = i/‘ iLIPS2(fc)<|T|2) 


iPil 

8irm% 


(\T\ 2 ) 


GlcjfZmlm* 



47T 


1 


(90.37) 
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where we used |pi| = (m^ — m 2 p )/2m- K to get the last line. Since we deter- 
mine the value of Gp from the decay rate of the muon (see section 88 ), the 
charged pion decay rate allows us to fix the value of cif n . 

The value of c\ can be determined from the rate for the decay pro- 
cess 7 r~ — ► 7 r°e~u e , which we will calculate in problem 90.6. The relevant 
hadronic term in the charged current is 

J ~ M = — -^ci7T°<9 Al 7r - , (90.38) 

which depends on c\ but not f n . Comparison with experiment then yields 
ci = 0.974. We note that the key feature of eq. (90.38) is that it involves 
spin-zero hadrons that are members of an isospin triplet; eq. (90.38) applies 
to any such hadrons, including nuclei. Thus ci can also be measured in 
superallowed Fermi decays , which take a nucleus from one spin-zero state to 
another spin-zero state with the same parity in the same isospin multiplet. 

Having thus determined ci, the charged pion decay rate yields = 
92.4 MeV. 

Next we consider neutron decay, n — ► pe~v e . The relevant terms in the 
charged currents (neutral currents do not contribute) are 


J M = 2 c 1 P7 /i (l-3A75)« , 

(90.39) 

j +fl = i^(l- 75 )A4 , 

(90.40) 


where £ is the electron field and J\f e is the electron neutrino field. The 
relevant term in the effective interaction is then 


£ e ff = ^G F cip7 /i (l- 5 A75)n^7M(l-75)A4 . (90.41) 

Consider a neutron with four- momentum p n = (m n , 0 ), and spin up along 
the z axis; the decay amplitude is 

? = ^G F c 1 [u p ^(l-g A ^ 5 )u n ][u e 'y ll (l-'y 5 )vp\ , (90.42) 

where u n u n = |(1— 7 5 ^)(— ^ n +m n ). We take the absolute square of T and 
sum over the final spins. Since the maximum available kinetic energy is 
m n — m p — m e = 0.782 MeV <C rn p , the proton is nonrelativistic, and we 
can use the approximations p p -p e — —iTipE e and p p pv — —m p Ep in addition 
to the exact formulae p n -p e = —m n E e and p n -pp = —m n Ep. After a tedious 
but straightforward calculation, we find 
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Figure 90.1: One- loop diagrams contributing to 7 r° — > 77. The solid line is 
a proton. 


where the correlation coefficients are given by 

1 -ffj _ 2g A (l - g A ) _ 2g A (l + g A ) 

1 + 3 0 ! ’ 1 + 3 5 i ’ 1 + 3 5 i 


(90.44) 


When we integrate over the final momenta to get the total decay rate, the 
correlation terms vanish, and so the rate is proportional to Gpcf (1 + 3 g A ). 
Since we get the value of Gytf from the rate for 7r _ — > 7r°e _ u e . the neutron 
decay rate allows us to determine 1 + 3 g\. To get the sign of g A , we need 
a measurement of either A or B. (The antineutrino three-momentum can 
be determined from the electron and proton three-momenta.) The result is 
that g A = +1.27. The measured values of the three correlation coefficients 
are all consistent with eq. (90.44). 

Finally, we consider the decay of the neutral pion into two photons, 
7r° ~ 1 > 77- None of the terms in our chiral lagrangian, eq. (90.12), couple 
a single 7r° to two photons. Therefore, without adding more terms, this 
process does not occur at tree level. However, at the one-loop level, we 
have the diagrams of fig. (90.1); a proton circulates in the loop. Let us 
evaluate these diagrams. In section 83, we found that the coupling of the 
7T° to the nucleons is given by 


C ttOnn = f^d^tp^^p - n7^7 5 n) . (90.45) 

This leads to a n°pp vertex factor of \{g A / /7r)fcp7 p 75- The diagrams in 
fig. (90.1) are then identical to the diagrams we evaluated in section 76, 
and so the one-loop decay amplitude is 


i'T'i-ioop = \{9a/ U){ ie) 2 e^£ 2 vkpC^ p {ki,k 2 ,k) , (90.46) 

where 

k p C^(h,k 2 , k ) = . (90.47) 

Here we have chosen to renormalize so as to have ki p C pup (ki, k 2 , k) = 0 and 
k 2u C lwp (k \ , &2, k) = 0; this is required by electromagnetic gauge invariance. 



90: Electroweak Interactions of Hadrons 


552 


Combining eqs. (90.46) and (90.47), we get 

2 

71-loop = - 1 fj- e a ^k la £ lfi k 20 £ 2u . (90.48) 

This result is subject to higher-loop corrections. Note that diagrams with 
extra internal pion lines attached to the nucleon loop are not suppressed by 
any small expansion parameter. Thus we cannot trust the overall coefficient 
in eq. (90.48). 

Note that this amplitude would arise at tree level from an interaction 
of the form o 77 oc ip£ a,l P v F tl(X F v fi. If we integrate out the nucleon fields 
to get an effective lagrangian for the pions and photons alone, such a term 
should appear. 

There is a problem, however. The SU(2 )l x SU(2)r x U(l)y symmetry 
of the effective lagrangian implies that a pion field that has no derivatives 
acting on it must be accompanied by at least one factor of a quark mass. 
For example, we could have C n o 77 oc i r Er{MU—M^U^)£ a ^ v F iia F v p. The 
problem is that there are no quark-mass factors in eq. (90.48). So we have an 
apparent contradiction between our explicit one- loop result, and a general 
argument based on symmetry. 

This contradiction is resolved by noting that the electromagnetic gauge 
field results in an anomaly in the axial current = J A — J'A . In terms 
of the quark doublet 

Q=(^j, (90.49) 

this current is 


•/a" = QT 3 'f'y 5 Q 

= \U^ lh U - \V^ lb V , (90.50) 

where we have suppressed the color indices. Using our results in sections 
76 and 77, the anomalous divergence of this current is given by 


9, A 11 = -^T \(T 6 Q 2 )e^F^F pa 


where 


Q = 


+1 o 

0 -i 


(90.51) 


(90.52) 


is the electric charge matrix acting on the quark fields, and the trace in- 
cludes a factor of three for color; we thus have 

Tr (T 3 Q 2 ) = 3(7(+|) 2 - i(-i) 2 ) = +| , 


(90.53) 
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and so 

d ,rJl 11 = ~^2 . (90.54) 

This formula is exact in the limit of zero quark mass. 

Now using eqs. (90.21) and (90.22), we can write the axial current in 
terms of the pion fields as 

t3 [i t3/j. t3^. 

'J A = — 

= fnd^TT 0 + ... . (90.55) 


(We do not include the nucleon contribution because we are considering 
the effective lagrangian for pions and photons after integrating out the 
nucleons.) From eq. (90.55) we have d p J A ^ = f^d 2 !? 0 + . . . ; Combining this 
with eq. (90.54), we get 


-3V = l ~ ^ Upa F^F pa + 0(/- 2 ) . (90.56) 

This equation of motion would follow from an effective lagrangian that 
included an interaction term of the form 


0 77 = ^e^F pu F pa . (90.57) 

This interaction leads to a 7r° — > 77 decay amplitude of 

e 2 

T = - ^2 r ^ upa ki^£i^k2 p £2a • (90.58) 


This amplitude receives no higher-order corrections in e 2 , but is subject 
to quark-mass corrections; these are suppressed by powers of ?n 2 /(47r/ 7r ) 2 . 
Comparing eq. (90.58) with eq. (90.48), we see the one-loop result (which 
receives unsuppressed corrections) is too large by a factor of g A = 1.27. 

Squaring T, summing over final spins, integrating over dLIPS2(fc), and 
multiplying by a symmetry factor of one half (because there are two iden- 
tical particles in the final state), we ultimately find that the decay rate 
is 


2 3 

am* 
647 r 3 / 2 ' 


(90.59) 


This prediction is in agreement with the experimental result, which has an 
uncertainty of about 7%. 


Reference Notes 


Electoweak interactions of hadrons are treated in Georgi and Ramond, II. 
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Problems 

90.1) Verify that the covariant derivatives in eqs. (90.7-90.10) transform 
appropriately. 

90.2) Verify that substituting eq. (90.11) into eq. (90.1) yields eq. (90.12). 

90.3) Compute the rate for the decay process t~ — ■> -k~u t . Look up the 
measured value and compare with your result. 

90.4) a) Verify eq. (90.43). 

b) Compute the total neutron decay rate. Given the measured neu- 
tron lifetime t = 886 s, and using Gp = 1.166 x 10 -5 GeV -2 and 
ci = 0.974, compute g A . Your answer is about 4% too high, because 
we neglected loop corrections, and the Coulomb interaction between 
the outgoing electron and proton. 

90.5) Use your results from problems 88.7 and 89.5 to show that the neutron 
decay rate is enhanced by a factor of 1 + ln(M w /m p ). How much 
of the 4% discrepancy is accounted for by this effect? 

90.6) Compute the rate for the decay process ir~ —> n 0 e~i' e . Note that, 
since m n + — m n o = 4.594 MeV <C o, the outgoing 7r° is nonrela- 
tivistic. Compare your calculated rate with the measured value of 
0.397 s^ 1 to determine ci. Your answer is about 1% too low, due to 
neglect of loop corrections. 

90.7) Verify eq. (90.59). Express T in eV. 
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91 Neutrino Masses 

Prerequisite: 89 

Recall from sections 88 and 89 that a single generation of quarks and leptons 
consists of left-handed Weyl fields q a i, u a , d a , li, and e in the representa- 
tions (3, 2, +±), (3, 1, -§), (3, 1, +3), (1, 2, -±), and (1, 1, +1) of the gauge 
group SU(3) x SU(2) x U(l). The Higgs field is a complex scalar pi in the 
representation (1,2, —5). The Yukawa couplings among these fields that 
are allowed by the gauge symmetry are 

^Yuk = -ye iJ (pi£je - y' £ lJ p>iq a jd a - y"pt l q ai u a + h.c. . (91.1) 

After the Higgs field acquires its VEV, these three terms give masses to the 
electron, down quark, and up quark, respectively. The neutrino remains 
massless. Thus, massless neutrinos are a prediction of the Standard Model. 

However, there is now good experimental evidence that the three neutri- 
nos actually have small masses. The data implies that mass of the heaviest 
neutrino is in the range from 0.04 eV to 0.5 eV. To account for this, we must 
extend the Standard Model. 

Let us continue to consider a single generation. We introduce a new 
left-handed Weyl field u in the representation (1,1,0); this held does not 
couple to the gauge fields at all, and its kinetic term is simply 
(The bar over the v in the held u is part of the name of the field , and does 
not denote any sort of conjugation.) With this new held, we can introduce 
a new Yukawa coupling of the form 


-C^Yuk = -yp u £in + h.c. . 

(91.2) 

In unitary gauge, this becomes 


-C^Yuk = ~^y( v + H){vv + pV) . 

(91.3) 


We see that the neutrino mass is rh = yv/y/2. 

If this was the end of the story, we would have no understanding of why 
the neutrino mass is so much less than the other hrst-generation quark and 
lepton masses; we would simply have to take y much less than y, ;</, and 

y" ■ 

However, because v is in a real representation of the gauge group, we 
are allowed by the gauge symmetry to add a mass term of the form 

h~-V mass = + NN) . (91.4) 

Here M is an arbitrary mass parameter. In particular, it could be quite 
large. 
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Adding eqs. (91.3) and (91.4), we find a mass matrix of the form 

A/Pmass = (z' ^)^_ jn+h,.. (91.5) 


If we take M 3> fh, then the eigenvalues of this mass matrix are M and 
(The sign of the smaller eigenvalue can be absorbed into the 
phase of the corresponding eigenfield.) Thus, if fh is of the order of the 
electron mass, then fh 2 /M is less than 1 eV if M is greater than 10 3 GeV. 
So y can be of the same order as the other Yukawa couplings, provided M 
is large. This is called the seesaw mechanism for getting small neutrino 
masses. The eigenfield corresponding to the smaller eigenvalue is mostly v, 
and the eigenfield corresponding to the larger eigenvalue is mostly za 

Another way to get this result is to integrate out the heavy D field at 
the beginning of our analysis. We get the leading term (in an expansion in 
inverse powers of M) by ignoring the kinetic energy of the v field, solving 
the equation of motion for it that follows from £p mas s + £j,Yuk> an d finally 
substituting the solution back into £p mass + -C^Yuk- The result is 


C 


z^Yuk+mass 


v_ 2 - 

2M L 


+ h.c. 


= -\m v {w + z/V t )( 1 + H/v) 2 , 


(91.6) 


where 


m u 



y 2 v 2 
2 M ' 


(91.7) 


Again, we can absorb the minus sign in eq. (91.7) by making the field re- 
definition is — > zza 

The seesaw mechanism has a straightforward extension to three gen- 
erations. Let us consider the fields in, e 7 , and is,, where I = 1,2,3 is a 
generation index. The most general Yukawa and mass terms we can write 
down now read 


Tvuk+mass = - ^^UVijVj ~ \MjjVjVj + h.c. , (91.8) 

where y u and y u are complex 3x3 matrices, M u is a complex symmetric 
3x3 matrix, and the generation indices are summed. In unitary gauge, 
this becomes 


Tvuk+mass = - ^75 {v+H )e I y IJ e j - -^(u+i7)z/ / y /J P J - \MuVjVj + h.c. . 

(91.9) 

We can now integrate out the zb fields; eq. (91.9) is then replaced with 
Tyuk+mass = - ^75 {v+H)e I y IJ e J - \ {m v ) ij(viVj+v\vI) { 1+H/v) 2 , (91.10) 
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where we have defined the complex symmetric neutrino mass matrix 

K)u = -^ 2 (y T M _1 y)/j • (9i.ii) 

We can make unitary transformations in generation space on the fields: 
e r — > E u ej , dr — > E u ej , and u, —> N IJ u J , where E, E, and N are inde- 
pendent unitary matrices. The kinetic terms are unchanged (except for the 
couplings to the W^, as we will discuss momentarily), and the matrices y 
and m v are replaced with E T yE and N T m u N. We can choose the unitary 
matrices E, E, and N so that E T yE and N T m. u N are diagonal with posi- 
tive real entries y r and m Ul . The neutrinos Af, then have masses m Ul , and 
the charged leptons S, have masses m ei = r y I v/\/2. In the neutral currents 
Jg and Jg M , we simply add a generation index I to each field, and sum 
over it. The charged currents are more complicated, however; they become 

, (91.12) 

■r 11 = , (91.13) 

where X = N^E is the analog in the lepton sector of the CKM matrix V 
in the quark sector. 

One difference, though, between X and V is that the phases of the 
Majorana A/) fields are fixed by the requirement that the neutrino masses 
are real and positive. Thus we cannot change these phases to make the first 
column of X real, as we did with V . We are allowed to change the phases 
of the Dirac fields, so we can make the first row of X real. Thus X has 
9 — 3 = 6 parameters, two more than the CKM matrix V. 

The presence of X in the charged currents leads to the phenomenon of 
neutrino oscillations. A neutrino that is produced by scattering an electron 
off a target will be a linear combination X u Vj of the neutrinos of definite 
mass. The different mass eigenstates propagate at different speeds, and 
then (in a subsequent scattering) may become (if there is enough energy) 
muons or taus rather than electrons. It is the observation of neutrino 
oscillations that leads us to believe that neutrinos do, in fact, have mass. 

Reference Notes 

Neutrino masses are discussed in detail in Ramond II. 

Problems 

91.1) Show that introducing neutrino masses via the seesaw mechanism 
results in lepton number no longer being conserved. 
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92 Solitons and Monopoles 

Prerequisite: 84 

Consider a real scalar field ip with lagrangian 

C = -^ipd^tp - V(ip) , (92.1) 

with 

V{ip) = \\{ip 2 -v 2 ) 2 . (92.2) 

As we discussed in section 30, this potential yields two ground states or 
vacua, corresponding to the classical field configurations ip{x) = +v and 
<p(x) = —v. After shifting the field by its VEV (either +u or —v), we find 
that the particle mass is m = \ l / 2 v. 

Let us consider this theory in two spacetime dimensions (one space 
dimension x and time t). In this case, tp and v are dimensionless, and A has 
dimensions of mass squared. In the quantum theory, the coupling is weak 
if A <C m 2 . 

The case of one space dimension is interesting for the following reason. 
The boundary of one-dimensional space consists of two points, x = — oo 
and x = Too. This topology of the spatial boundary is mirrored by the 
topology of the space of vacuum field configurations, which also consists 
of two points, ip(x) = — v and <p(x) = +v. In each vacuum, both spatial 
boundary points (x = — oo and x = +oo) are mapped to the same field 
value (either — v or +u). This is a trivial map. More interesting is the 
identity map , where x = — oo is mapped to ip = —v, and x = +oo is 
mapped to ip = +v. This map does not correspond to a vacuum; the field 
must smoothly interpolate between ip = —v at x = — oo and ip = +v at 
x = Too, and this requires energy. The interesting question is whether it 
can be done at the cost of a finite amount of energy. 

To make these notions more precise, we will look for a minimum en- 
ergy, time-independent solution of the classical field equations, with the 
boundary conditions 

lim ipix) = ±v . (92.3) 

X — >zboO 

The total energy is given by 

/ +°o r 

dx + \ip ,2 + V{p>) . (92.4) 

-OO L J 

The solution of interest is time independent, so we can set <fi = 0. We can 
also rewrite the remaining terms in E as 
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f+OO 


2 r+v 

+ / V2V(<p) dip 

J — V 


dx \ [ip - V2 V(ip) ) 

J dx\ (p' - V2V(p ) ) + § ( m 2 /\)m 


(92.5) 


Since the first term in eq. (92.5) is positive, the minimum possible energy is 
M = |(m 2 /A)m; this is much larger than the particle mass m if the theory 
is weakly coupled (A <C m 2 ). Requiring the first term in eq. (92.5) to vanish 
yields tp' = \J 2V (<p), which is easily integrated to get 

<p(x) = vtanh^m(x — xo)) , (92.6) 

where xq is a constant of integration. The energy density is localized near 
x = xq, and goes to zero exponentially fast for \x — xq | > 1 /m. 

This solution is a soliton, a solution of the classical field equations with 
an energy density that is localized in space, and that does not dissipate or 
change its shape with time. In this case (and in all cases of interest to us), 
its existence is related to the topology of the boundary of space and the 
topology of the set of vacua, and the existence of a nontrivial map from the 
boundary of space to the set of vacua. 

Given eq. (92.6), we can get other soliton solutions by making a Lorentz 
boost; these solutions take the form 

<p(x, t) = v tanh^ 7 m(x — xq — /3t)^j , (92.7) 

where 7 = (1 — /3 2 ) -1 / 2 ; their energy is E = 7 M = (p 2 + Af 2 ) 1 / 2 , where 
M = |(m 2 /A)m is the energy of the soliton at rest, and p = 7 (3M is 
the momentum of the soliton, found by integrating the momentum density 
T 01 = ip'd°<p. 

We see that the soliton behaves very much like a particle. We may 
expect that, in the quantum theory, the soliton will correspond to a new 
species of particle with mass M, in addition to the elementary field excita- 
tion with mass m. 

The soliton solution is still interesting if there is more than one spatial 
dimension. In that case, eq. (92.6) describes a domain wall , a structure that 
is localized in one particular spatial direction, but extended in the others. 
The wall has a surface tension (energy per unit transverse area) given by 
a = |m 3 /A. 

Having found a theory that has a soliton that is localized in one spatial 
direction, let us try to find a theory that has a soliton that is localized in 
two spatial directions. In two space dimensions, the spatial boundary has 
the topology of a circle, denoted by the symbol S 1 . There is no smooth 
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nontrivial map from a circle to two points; continuity of the map requires 
the entire circle to be mapped into one of the two points. But there do 
exist smooth nontrivial maps from one circle to another circle, as we will 
discuss momentarily. 

So, we would like to find a theory whose vacua have the topology of 
a circle. To this end, let us consider a complex scalar field tp(x), with 
lagrangian 

C = -d^^d^ip - V(ip) , (92.8) 

where 

V(<p) = k A(v>V-u 2 ) 2 . (92.9) 

The vacuum field configurations are 

<p(x) = ve ia , (92.10) 

where a is an arbitrary angle. This angle specifies a point on a circle, and 
so the space of vacua does indeed have the topology of S 1 . 

Let us write x = r(cos0, sin</>); then the angle (j) specifies a point on 
the spatial circle at infinity. We can specify a map from the spatial circle 
to the vacuum circle by giving a as a function of <f>. In order for ip(x) to be 
single valued, this function must obey a{4> + 2i r) = a{4>) + 2im, where the 
integer n is the winding number of the map: we wind around the vacuum 
circle n times for every one time that we wind around the spatial circle. 
(If n is negative, the vacuum winding is opposite in direction to the spatial 
winding.) An example of a map with winding number n is U (cj>) = e m< ^. 
Setting n = 0 then yields the trival map, n = 1 the identity map, and 
n = — 1 the inverse of the identity map. 

Given a smooth map its winding number can be written as 

n = — d<\> Ud+rf , (92.11) 

27t Jo 

where U' is the complex conjugate of U. To verify that eq. (92.11) agrees 
with our previous definition, we first check that plugging in our example 
map indeed yields the correct value of the winding number. We then show 
that the right-hand side of eq. (92.11) is invariant under smooth deforma- 
tions of U (</>)', see problem 92.2. Thus any U {<p) that can be smoothly 
deformed to e * n< Jp has winding number n. 

Next, we want to look for a finite-energy solution of the classical field 
equations for the theory specified by eqs. (92.8) and (92.9), with the bound- 
ary condition 


lim c p(r,q !>) = vU{4>) 


(92.12) 
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with U{4>) corresponding to a map with nonzero winding number. We 
therefore make the ansatz 


<p( r i <t>) = vf(r)e m,p , 


(92.13) 


where f(r) is a real function that obeys /( oo) = 1. We must also have 
/( 0) = 0 so that V<£>(r, (j)) is well defined at r = 0. 

Alas, it is easy to see that there is no finite-energy solution of this form. 
The gradient of the field is 


V<p = vyf'{r)r + mr _ 1 /(r)</>j e m ^ 
and the gradient energy density is 

|V(^| 2 = v 2 \f'(r) 2 + n 2 r~ 2 f(r) 2 


(92.14) 


(92.15) 


At large r, f(r) must approach one; then the integral over the second term 
in eq. (92.15) diverges logarithmically, 

j d 2 x | V</3| 2 ~ 2irn 2 v 2 J — . (92.16) 

So the energy is infinite. This is, in fact, a very general result, known as 
Derrick’s theorem: with scalar fields only, there are no finite-energy, time- 
independent solitons that are localized in more than one dimension. The 
problem is that the gradient energy always diverges at large distances from 
the putative soliton’s core. 

To get solitons that are localized in more than one dimension, we must 
introduce gauge fields. Note that the lagrangian of eq. (92.8) has a global 
U(l) symmetry. Let us gauge this U(l) symmetry, so that the lagrangian 
becomes 

C = -(D^D^ - V(<p) - \F^ W , (92.17) 

where 

- ieAjjip , (92.18) 

and V ((p) still given by eq. (92.9). The gauge symmetry is therefore sponta- 
neously broken, and the mass of the vector particle is rn v = ev. The mass 
of the scalar particle is rn s = X l / 2 v. 

The gradient energy density of the scalar field is now 

\D^\ 2 = \(V - ieA)p\ 2 . (92.19) 


Thus we have the opportunity to choose A so as to partially cancel the 
badly behaved second term in eq. (92.14). To see how to do this, recall that 
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a gauge transformation in this theory takes the form 

p -»• Up, (92.20) 

A„ - , (92.21) 

where U is a 1 x 1 unitary matrix that is a function of spacetime. As r —> oo, 
our ansatz for p, eq. (92.13), corresponds to a gauge transformation of a 
vacuum, p = v, by U = e in< ^. The corresponding transformation of A ^ = 0 
is 


lim A(r, (j>) = iuvrf 

r— > oo c 

71 /v 

= —6. (92.22) 

er 

Before making the gauge transformation, we have p = v and A M = 0, and 
so D^p = 0; by gauge invariance, this must be true after the transformation 
as well. Indeed, it is easy to check that, with A given by eq. (92.22), we 
have (V — ieA)ve 1 ^ = 0. 

For n / 0, the gauge transformation U = e in< ^ is large: it cannot be 
smoothly deformed to U = 1. This implies that we cannot extend it from 
r = oo into the interior of space without meeting an obstruction, a point 
where U (r, 4>) is ill defined. For example, the simplest attempt at such an 
extension, U(r,(j)) = e im is ill defined at r = 0. Near the obstruction, 
the fields p and A must deviate from a gauge transformation of a vacuum. 
This deviation costs energy, and results in a soliton. 

Our ansatz for a soliton in the theory specified by eq. (92.17) is then 

p{r,4>) = vf(r)U(<t>) , (92.23) 

A (r,<j>) = | a(r)U(</>)VUt(<l>) , (92.24) 

where U{(j>) = e m< ^, and we require /(oo) = a(oo) = 1 (so that the solution 
approaches a large gauge transformation of a vacuum as r — > oo) and 
/( 0) = o(0) = 0 (so that A and Vp are well defined at r = 0). For n = 1, 
this soliton is a Niels en-Olesen vortex. 

The nonzero vector potential results in a perpendicular magnetic field 

B = V x A 



e r 


(92.25) 
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The corresponding magnetic flux is 
$ = j dS • B 


= lim d£ ■ A 

r— xx) J 

i /‘27T 

= - lim a(r) / df UdJJ ^ 
e Jo 

2irn 


e 


(92.26) 


Here the second line follows from Stokes’ theorem, the third from eq. (92.24), 
and the fourth from eq. (92.11). 

The energy of the soliton is 


E = J d 2 x [|(V - ieA)ip\ 2 + V(tp) + ±B 2 ] . (92.27) 


Substituting in our ansatz, eqs. (92.23) and (92.24), we get 

E = 2nv 2 f dr r / /2 + ^-(a-l) 2 / 2 + |Au 2 (/ 2 -l) 2 + ™ a' 2 . (92.28) 

Jo L r e dt 

It is convenient to define a dimensionless radial coordinate p = evr = m v r. 
Let us also define /3 2 = A/e 2 = m 2 /mf. Then eq. (92.28) becomes 

E = 2 irv 2 ^ dp p\f 12 + ^(a - iff 2 + i/J 2 (/ 2 - l) 2 + ^a' 2 1 , (92.29) 

Jo L p p i 

where a prime now denotes a derivative with respect to p. We can find the 
equations obeyed by f(p) and a(p) either by substituting the ansatz into 
the equations of motion, or by applying the variational principle directly 
to eq. (92.29). Either way, the result is 

f" + -~ ^£(1 - af + \(5 2 { 1 - f 2 )f = 0 , (92.30) 

p p 2 

a” - — + (1 - a)f 2 = 0 , (92.31) 

P 

with the boundary conditions a(0) = /( 0) = 0 and a(oo) = fioo) = 1. 

Eqs. (92.30) and (92.31) have no closed-form solution. However, for 
p <C 1, we can show that a(p) ~ p 2 and f(p) ~ P n \ and for p 3> 1, that 
1 — a(p) ~ e~ p and 1 — f(p) ~ e~ cp , where c = min(/3, 2); see problem 
92.4. For n and /3 of order one, the integral in eq. (92.29) also results in a 
number of order one, and so we have E ~ 2irv 2 . For (3 > 1, it is possible 
to prove a Bogomolny bound , E > 2irv 2 \n\. In this case, a soliton with 
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winding number n is unstable against breaking up into \n\ solitons, each 
with winding number one (or minus one, if n is negative). 

Once we have our soliton solution, we can translate and/or boost it; thus 
we expect the soliton to behave like a particle in two space dimensions. In 
three space dimensions, the soliton becomes a Nielsen- Olesen string (also 
called a gauge string ), a structure that is localized in two directions, but 
extended in the third. Such strings can bend, and even form closed loops. 
In certain unified theories (see section 97), gauge strings may have formed 
in the early universe; they are then called cosmic strings. 

Now let us try to find a soliton that is localized in three spatial direc- 
tions. In three space dimensions, the spatial boundary has the topology of 
a two-dimensional sphere S 2 . There are smooth nontrivial maps from S 2 to 
S 2 , as we will discuss momentarily, so let us look for a theory whose vacua 
have the topology of S 2 . 

Consider three real scalar fields ip a , a = 1, 2, 3, with lagrangian 


c = -1 &*p a d^ a - V{p) , 

(92.32) 

where 


V{p) = \\ (p a p a -v 2 ) 2 . 

(92.33) 

The vacuum field configurations are 


p a {x) = vp a , 

(92.34) 


where p is an arbitrary unit vector. This unit vector specifies a point on a 
two-sphere, and so the space of vacua does indeed have the topology of S 2 . 

Let us write x = ?’(sin0cos</, sin 0 sin ^>, cos0); then the polar and az- 
imuthal angles 9 and 0 specify a point on the spatial two-sphere at infinity. 
We can specify a map from the spatial two-sphere to the vacuum two- 
sphere by giving (p as an (appropriately periodic) function of 9 and 0. We 
can define a winding number n that counts the number of times the vacuum 
two-sphere covers the spatial two-sphere, with n negative if the orientation 
is reversed. An example of a map with winding number n can be con- 
structed by taking the polar angle of tp to be 9, and the azimuthal angle to 
be ncj). Setting n = 1 then yields the identity map, and n = — 1 the inverse 
of the identity map. 

Given a smooth map <p a {9, </>), its winding number can be written as 

n = — [ d 2 9 £ abc £ ij <p a dip b dj<p c , (92.35) 

87t J 

where d?9 = d9 d(j), d\ = d/d9, 82 = d/d(j>, and e 12 = — e 21 = +1. To 
verify that eq. (92.35) agrees with our previous definition, we first check 
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that plugging in our example map indeed yields the correct value of the 
winding number; see problem 92.5. We then show that the right-hand 
side of eq. (92.35) is invariant under smooth deformations of ip a (9,(f>)] see 
problem 92.6. It is also worthwhile to note that the right-hand side of 
eq. (92.35) is invariant under a change of coordinates, because the jacobian 
for d 2 9 is cancelled by the jacobian for di& 2 - This is of course closely related 
to the invariance under smooth deformations, since one way to make such 
a deformation is via a coordinate change. 

Next, we want to look for a finite-energy solution of the classical field 
equations with nonzero winding number, but we already know that these 
will not exist unless we introduce gauge fields. We therefore take (p a to be 
in the adjoint representation of an SU(2) gauge group. The lagrangian is 


now 


£ = -\{D^Y(D,<p) a - V(<p) - \f^f; v , 

(92.36) 

where 


(D^y = d,<p a + ee abc Ay, 

(92.37) 


F% = d ll Al-d v Al + ee abc A b ll Al, 

(92.38) 


and V(:p) is given by eq. (92.33). We have called the gauge coupling e for 
reasons that will become clear in a moment. 

The gauge symmetry is spontaneously broken to U(l). If we take the 
vacuum field configuration to be (p a = v5 a 3 , then the A ^ field remains 
massless; we will think of it as the electromagnetic field. The complex 
vector fields W± = (A* iA 2 t )/y/ 2 get a mass m w = ev, and have electric 

charge ±e. (This is the reason for calling the gauge coupling e.) This 
theory, known as the Georgi- Glashow model , was once considered as an 
alternative to the Standard Model of electroweak interactions (but is now 
ruled out, because it does not have a Z° boson). 

When the vacuum field configuration is (p a = v8 ai , the electromagnetic 
field strength is Tjj v = — d u A^. We can write down a gauge-invariant 

expression that reduces to F IW when we set <p a = v5 a 3 ; this expression is 

F, iV = <p a F; v - e~ 1 £ abc (p a (D fJ- <p) b (D u (p) c . (92.39) 

Here ip a = <^ a /|<^|, where \g>\ = ((p a g> a ) 1 / 2 . We can, in fact, use eq. (92.39) as 
the definition of the electromagnetic field strength at any spacetime point 
where \<p\ ^ 0. (If \tp\ =0, the SU(2) symmetry is unbroken, and there is no 
gauge-invariant way to pick out a particular component of the nonabelian 
field strength F“„.) If we substitute in eqs. (92.37) and (92.38), and make 
repeated use of (p a (p a = 1 and the identity £ abc £ ade = S bd 5 ce — 5 be 8 cd , it is 
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possible to rewrite eq. (92.39) as 

F/iv = d^Al) - d„(f> a A“) - e- l s abc v a d^ b d„v c • (92.40) 

In particular, the magnetic field is 
B' = \e ijk F jk 

= £ ijk dj^ a At) - (2 e)- 1 £ iik e ahc (p a d j (p h d k (p c . (92.41) 


Let us consider the magnetic flux through a sphere at spatial infinity; 
this is given by $ = / dS • B , where dS k = r 2 sin 9 d.0 d(j> x k , and x = x/r 
is a radially outward unit vector. The first term in eq. (92.41) for B is 
V x (<^> a A a ); since this is a curl, it has zero divergence, and therefore zero 
surface integral. From eq. (92.35), we see that the second term in eq. (92.41) 
results in 


= — 


47TO 


(92.42) 


This flux implies that any soliton with nonzero winding number is a mag- 
netic monopole with magnetic charge Qm = 4>. (In Heaviside-Lorentz units, 
the Coulomb field of an electric point charge Qe is E t = Q^Xi/Anr 2 , and 
so the total electric flux is Q e- We adopt the same convention for magnetic 
charge.) 

If we add a field in the fundamental representation of SU(2), then the 
component fields have electric charges ±|e. This is the smallest electric 
charge we can get, and all possible electric charges are integer multiples 
of it. Eq. (92.42) tells us that all possible magnetic charges are integer 
multiples of Air/e. Thus the possible electric and magnetic charges obey 
the Dirac charge quantization condition , which is 


QeQm = 2irk , 


(92.43) 


where k is an integer. This condition can be derived from general consid- 
erations of the quantum properties of monopoles. 

Now let us turn to the explicit construction of a soliton solution. This 
simplest case to consider is provided by the identity map (which has winding 
number n = 1); the soliton we will find is the ’tHoo ft- Polyakov monopole. 

The boundary condition on the scalar field is 


lim <p a (x.) = vx a /r . (92.44) 

r — xx) 

We can find the appropriate boundary condition on the gauge field by 
requiring (D^ip) 0, = 0 in the limit of large r. This condition yields 

di(x a /r) + ee abc A b i x c /r = 0 . 


(92.45) 
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We have di(x a /r ) = ( r 2 S a i — x a Xi)/r 3 . Next we multiply by rxj£^ da , and 
use the identity el da s abc = Sl b 5 dc — S dC S db to get 

£ dij xj + e(x d XjA{ - r 2 A d ) = 0 . (92.46) 

If we ingore the first term in the parentheses, we find Af = £ di ^Xj/er 2 . But 
then XdA d = 0, and so the first term in the parentheses vanishes. Thus we 
have found the needed asymptotic behavior of Af. Our ansatz is therefore 

9 ? a (x) = vf(r)x a /r , (92.47) 

Af(pc) = a(r)£ ai ^ x j / er 2 . (92.48) 

We require /( oo) = a(oo) = 1 (so that Af and (p a have the desired asymp- 
totic limits) and /( 0) = a(0) = 0 (so that Af and (p a are well defined at 
r = 0). 

The total energy of the soliton (which we will call M, because it is the 
mass of the monopole) is given by 


M= drx 


\BfBf + \{D^) a {D^) a + V{ip) 


(92.49) 


The nonabelian magnetic held is 

B? = \e ijk F% 

— tijkdjAf. + \ e£ijk£ abc A b Af. . (92.50) 

If we write eq. (92.48) as Af = Kj, then after some manipulation we 
find that eq. (92.50) becomes Bf = d a Ki — 5 a idjK d + K a Ki. Plugging in 
K, = a(r)xi/er 2 then yields 


o' ( 

f c * * \ 

[ 7 < 

Oai XaXi j 


B? = -~ 
e 


The magnetic held energy then becomes 

1 


2 a — a 2 „ „ 

2 3'0,3'i 


1 iDa Tja 

2 * * “ 2 eV L 


2 r 2 a' 2 + ( 2 a - a 2 ) 2 


The covariant derivative of the scalar held is 

'(! - a )f 


(Dw) a = v 


($ai ~ X a Xi^j + f X a Xi 


The scalar gradient energy density then becomes 

„2 „ 


i(A¥>) a (A¥>)° = ^2 i 2 (! - °) 2 / 2 + r ' 2 f r 


(92.51) 


(92.52) 


(92.53) 


(92.54) 
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The scalar potential energy density is 

V(<p) = v 4 (f 2 - l) 2 . (92.55) 

We can plug eqs. (92.52), (92.54), and (92.55) into eq. (92.49), and then 
use the variational principle to get the second-order differential equations 
obeyed by f(r) and a(r). 

We can get a lower bound on M by performing a trick analogous to the 
one we used in eq. (92.5). We write 

+ i(A¥>)°(A¥>)° = \[Bi + {DwYf ~ B i( D i<p) a . (92.56) 

We can apply the distribution rule for covariant derivatives (see problem 
70.5) to rewrite the last term as B^(Di^) a = di(B?<p a ) — (DiBi) a <p a . Then 
we note that the Bianchi identity (see problem 70.6) implies ( DiBi) a = 0. 
Thus B?(Di(p) a = di(Bf:p a ), and this is a total divergence. Then Gauss’s 
theorem yields 

J d 3 xdi{B?(p a ) = J dSi B?ip a , (92.57) 

where the integral is over the surface at spatial infinity. On this surface, 
we have (p a = vx a . 

Next we use eq. (92.39). At spatial infinity, the covariant derivatives 
of (p vanish; thus we have B^p> a = vBi, where Bi is the magnetic field of 
electromagnetism. We can now see that the right-hand side of eq. (92.57) 
evaluates to where <f> = Qm = — Airn/e is the magnetic charge of the 
monopole. 

In our case, n = 1 and Qm is negative; thus the last term in eq. (92.56) 
integrates to u|<5m|- (For the case of positive Qm, we can swap the plus 
and minus signs in eq. (92.56) to get the same result.) Thus the mass of 
the monopole, eq. (92.49), can be written as 

M = + J d 3 x \[Bf + (sign n){Dnp) a } 2 + V{tp) , (92.58) 

Both terms in the integrand of eq. (92.58) are positive, and so we have a 
Bogomolny bound on the mass of the monopole. For A > 0, a monopole 
with winding number n is unstable against breaking up into |n| monopoles, 
each with winding number one (or minus one, if n is negative). 

Using m w = ev and a = e 2 /47r, we can write the Bogomolny bound as 

M > —Ini . (92.59) 

a 

Since aCl, the monopole is much heavier than the W boson. 

Alas, the Georgi-Glashow model, which has monopole solutions, is not 
in accord with nature, while the Standard Model, which is in accord with 
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nature, does not have monopole solutions. This is because, in the Standard 
Model, electric charge is a linear combination of an SU(2) generator and 
the U(l) hypercharge generator. Nothing prevents us from introducing 
an SU(2) singlet field with an arbitrarily small hypercharge. Such a field 
would have an arbitrarily small electric charge (in units of e), and then 
the Dirac charge quantization condition would preclude the existence of 
magnetic monopoles. 

This disappointing situation is remedied in unified theories (see section 
97), where the gauge group has a single nonabelian factor like SU(5). In 
unified theories, the monopole mass is of order m x /a, where m x is the mass 
of a superheavy vector boson; typically m x 10 15 GeV. 

Returning to the Georgi- Glashow model, we can saturate the Bogo- 
molny bound if we consider the formal limit of A — ■> 0; then V(ip) vanishes. 
(This limit is formal because we need a nonzero potential to fix the magni- 
tude of ip at infinity.) Then we saturate the bound if Bf = —(sign n){Diip) a . 
In the case of the ’t Hooft-Polyakov monopole, we have n = 1, and Bf 
and ( Ditp) a are given by eqs. (92.51) and (92.53). Matching the coeffi- 
cients of 5 a i — x a Xi and x a Xi yields a pair of first-order differential equa- 
tions. These look nicer if we introduce the dimensionless radial coordinate 
p = evr = m w r\ then we find 

a = (1 - a)f , (92.60) 

f = (2 a - a 2 )/ p 2 , (92.61) 


where a prime now denotes a derivative with respect to p. These equations 
have a closed-form solution, 


a(p) = 1 - 


P 

sinh p 


f{p) 


coth p 


1 

P ' 


(92.62) 

(92.63) 


This is the Bogomolny-Prasad-Sommerfeld (or BPS for short) solution. 
A soliton that saturates a Bogomolny bound is generically called a BPS 
soliton. 


Reference Notes 

Discussions of solitons, and their relation to the theory of homotopy groups, 
can be found in Coleman and Weinberg II. 


Problems 
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92.1) Derrick’s theorem says that, in a theory with scalar fields only, there 
are no solitons localized in more than one dimension. To prove this, 
consider a theory in D space dimensions with a set of real scalar fields 
(fi\ any complex scalar fields are written as a pair of real ones. The 
lagrangian is C = — ^d^pid^pi — V(pi), with V(pi) > 0. Suppose 
we have a soliton solution </?;(x); its energy is E = T + U, where 
T = i / d D x (Vpi ) 2 and U = f d D x V(pi). 

a) Now consider ^(x/a), where a is a positive real number. Show 
that, for this held configuration, the energy is E(a) = a D ~ 2 T + a D JJ . 

b) Argue that we must have E'( 1) = 0. 

c) Use this to prove the theorem. 

92.2) The winding number n for a map from S 1 — » S 1 is given by eq. (92.11), 
where U'U = 1. We will prove that n is invariant under an infinites- 
imal deformation of U. Since any smooth deformation can be made 
by compounding infinitesimal ones, this will prove that n is invariant 
under any smooth deformation. 

a) Consider an infinitesimal deformation of U, U — » U + dU. Show 
that SU* = -U^ 2 dU. 

b) Use this result to show that diUd^U^) = —d^U^dU). 

c) Use this to show that dn = 0. 

92.3) Show that if U n ((j)) and Uk(4>) are maps from S 1 — * S 1 with winding 
numbers n and k. then U n ((f>)Uk((t>) is a map with winding number 
n + k. Hint: consider smoothly deforming U n (<j)) to equal one for 
0 < (f> < tt. How should Uk(4>) be deformed? 

92.4) Verify the statements made about the solutions to eqs. (92.30) and 
(92.31) in the limit of large and small p. 

92.5) Use eq. (92.35) to compute the winding number for the map specified 
by ip = (sin 6 cos nq i>, sin 6 sin ruj), cos 6) . 

92.6) The winding number n for a map from S 2 — ► S 2 is given by eq. (92.35), 
where ip a (p a = 1. We will prove that n is invariant under an infinites- 
imal deformation of (p. Since any smooth deformation can be made 
by compounding infinitesimal ones, this will prove that n is invariant 
under any smooth deformation. 

a) Consider an infinitesimal deformation of <p, tp — > <p + dip. Show 
that <p-5<p = 0 and that (p-dip = 0. 

b) Use these results to show that s abc dp a dip h djp c = 0. 

c) Use this to show that dn = 0. 
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93 Instantons and Theta Vacua 

Prerequisite: 92 

Consider SU(2) gauge theory, with gauge fields only. The classical field con- 
figuration corresponding to the ground state is = 0. This implies that 
the vector potential A“ is a gauge transformation of zero, A ^ = A^T 11 = 

Let us restrict our attention to gauge transformations that are time 
independent, U = U(x). This fixes temporal gauge, Aq = 0. We will also 
impose the boundary condition that U (x) approaches a particular constant 
matrix as |x| — ► oo, independent of direction. This is equivalent to adding 
a spatial “point at infinity” where U has a definite value; space then has 
the topology of a three-dimensional sphere S 3 . 

Can every [/(x) be smoothly deformed into every other £/(x)? If the 
answer is yes, then all these field configurations are gauge equivalent, and 
they correspond to a single quantum vacuum state. If the answer is no, then 
there must be more than one quantum vacuum state. To see why, suppose 
that t/(x) and U (x) cannot be smoothly deformed into each other. The 
associated vector potentials, A /t = | Ud^U 1 and A M = ^Ud^W, are both 
gauge transformations of zero, and so both F fJV and F fW vanish. However, if 
we try to smoothly deform A M into A^, we must pass through vector poten- 
tials that are not gauge transformations of zero, and whose field strengths 
therefore do not vanish. These nonzero field strengths imply nonzero en- 
ergy: there is an energy barrier between the field configurations A M and 
A Therefore, they represent two different minima of the hamiltonian in 
the space of classical field configurations. Different minima in the space of 
classical field configurations correspond to different vacuum states in the 
quantum theory. 

It turns out that every U (x) can not be smoothly deformed into every 
other I/(x); the field configurations specified by I/(x) are classified by a 
winding number. To see this, we first note that any 2x2 special unitary 
matrix U can be written in the form 

U = 04 + ia-a , (93.1) 

where 04 and the three-vector a are real, and 

a 2 + a\ = 1 ; (93.2) 

see problem 93.1. Thus a M = [a, 04 ) specifies a euclidean four-vector of unit 
length, = 1, and hence a point on a three-sphere. We will call this 

the vacuum three-sphere. Since our boundary conditions give space the 
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topology of a three-sphere, U(x) provides a map from the spatial three- 
sphere to the vacuum three-sphere. We can define a winding number n 
that counts the number of times the vacuum three-sphere covers the spatial 
three-sphere, with n negative if the orientation is reversed. 

It is convenient to specify the spatial three-sphere by a euclidean four- 
vector Zfj, = (z,Z 4 ) of unit length, z^z^ = 1. An explicit relation between 
z^ and x can be constructed by (for example) stereographic projection: 
we take z = z/\z\ = x, and \z\ = 2?’/(l+r 2 ), Z 4 = (1— r 2 )/(l+r 2 ), where 
r = |x|. Then we can construct an example of a map from the spatial S 3 
to the vacuum S 3 with winding number n by taking the two polar angles 
of (if, to be equal to the two polar angles of z^, and the azimuthal angle of 
a n to be equal to n times the azimuthal angle of z tl . (The polar angles run 
from 0 to 7r, and the azimuthal angle from 0 to 2n.) 

Given a smooth map t/(x), its winding number can be written as 

n = ^ J cfxe^TrKUdiU^iUdjU^iUdkU^)} . (93.3) 

Here we have used the original x coordinates, but we could also use the 
angles that specify z^: the integral in eq. (93.3) is invariant under a change 
of coordinates, because the jacobian for d 3 x is cancelled by the jacobian 
for 818283 . To verify that eq. (93.3) agrees with our previous definition, we 
first check that plugging in our example map indeed yields the correct value 
of the winding number; see problem 93.5. We then show that the right- 
hand side of eq. (93.3) is invariant under smooth deformations of t/(x); see 
problem 93.3. 

So, we have concluded that SU(2) gauge theory has an infinite number 
of classical field configurations of zero energy, distinguished by an integer 
n, and separated by energy barriers. This is analogous to a scalar field 
theory with a potential 

V(ip) = Au 4 [1 — cos(27 up/v)] . (93.4) 

This potential has minima at (p = nv, where n is an integer. Let |n) be 
the quantum state corresponding to the minimum at ip = nv. Generically, 
between two quantum states | n) and \n') that are separated by an energy 
barrier, there is a tunneling amplitude of the form 

(n'\H\n) ~ e~ s , (93.5) 

where H is the hamltonian, and S is the euclidean action for a classical 
solution of the euclidean field equations that mediates between the field 
configuration corresponding to n at t = — 00 , and the field configuration 
corresponding to n' at t = + 00 . In the scalar field theory, this solution 
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is independent of x. Thus, S scales like the volume of space V. and so 
(n'\H\n) vanishes in the infinite volume limit. The minima of eq. (93.4) 
therefore remain exactly degenerate in the quantum theory. 

Things are different in the SU(2) gauge theory. In this case, there is 
a classical solution of the euclidean field equations that mediates between 
states with winding numbers n and n! , and that has an action that stays 
fixed and finite in the infinite- volume limit. The value of this action is 
S = \n'—n\Si, where S\ = 8n 2 /g 2 , and g is the Yang-Mills coupling con- 
stant. For n' = n + 1, this solution is the instanton. The instanton is 
localized in all four euclidean directions. For n' = n — 1, the solution is 
the antiinstanton. For \n' — n\ > 1, the solution is a dilute gas of \n' — n| 
instantons (or antiinstantons, if n' — n is negative) distributed throughout 
euclidean spacetime. 

We will shortly construct the instanton and examine its properties, but 
first we study the consequences of its existence. For SU(2) gauge theory, 
eq. (93.5) reads 

(n'\H\n) ~ e - |n '-" 151 . (93.6) 

These matrix elements depend only on n' — n, and so H can be diagonalized 
by theta vacua of the form 


+oo 

\0) = E e ~ in » ; (93.7) 

n =— oo 

see problem 93.2. For weak coupling, Si 3> 1, and so we can neglect all 
matrix elements of H except those with n' = n ± 1. Then we find that the 
energy of a theta vacuum is proportional to — cos 0. (We are of course free 
to add a constant to H so that the lowest lying state, the theta vacuum 
with 9 = 0, has energy zero.) 

We have derived these results in the weak-coupling regime. However, 
we are discussing properties of low-energy states, and the gauge coupling 
becomes large at low energies. Therefore we must consider the theory to 
be in the strong-coupling regime. How does this affect our conclusions? 

The topological properties of the gauge fields are independent of the 
value of the coupling constant, so we still expect vacuum states labeled by 
the winding number n to exist. We also expect that {n'\H\n) will depend 
only on | n' — n\. To see this, consider making a gauge transformation by 
t/fc(x), where t/fc(x) has winding number k. The product of two maps 
with winding numbers n and A; is a map with winding number n + k\ see 
problem 93.4. Thus, making a gauge transformation by Uk(x) converts a 
field configuration with winding number n to one with winding number 
n + k. In the quantum theory, the gauge transformation is implemented by 
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a unitary operator U k , and we should have 

Uk\n) = | n+k) . (93.8) 

On the other hand, the hamiltonian, which is built out of field strengths, 
must be invariant under time-independent gauge transformations: 

U k HU ] k = H . (93.9) 

Inserting factors of I = u\u k on either side of H in (n'\H\n), and using 
eqs. (93.8) and (93.9), we find 

(n'\H\n) = (n'+k\H\n+k) . (93.10) 

We conclude that (n'\H\n) depends only on n ' — n. We can also note that 
winding number is reversed by parity, P\n ) = | — n), and that the Yang- 
Mills hamiltonian is parity invariant, PHP = P[, to conclude similarly 
that (n'\H\n) = (—n'\P[\—n). Thus (n'\H\n) depends only on \n' — n |. 

The fact that (n'\H\n) depends only on | n' — n| tells us that the theta 
vacua are still eigenstates of H. Furthermore, their energies must be a 
periodic, even function of 9. Of course, the eigenvalues of H should scale 
with the volume of space V. Then, on dimensional grounds, we have 

H\9) = VA 4 QCO f(9)\9), (93.11) 

where A QCD is the scale where the gauge coupling becomes strong. The 
function f{9) must obey f(9 + 2n) = f(9) and f{—9) = f{0). We expect 
the minimum of f(9) to be at 9 = 0. 

We turn now to the solutions of the euclidean field equations. At eu- 
clidean time X 4 = —T, we set A^(x) = ^l/_(x)<9 Al t/l(x), where t/_(x) 
has winding number n_. Similarly, at euclidean time X 4 = +T, we set 
A m (x) = ^U + (’x)d^ul r ('x), where U + (x.) has winding number n + . At 
|x| = R, for —T < X 4 < T, we set the boundary condition A ^ = 0. This 
is equivalent to A fl = with = 0; we therefore set 17(x) to a 

constant matrix at |x| = R. We want to take T and R to infinity at the 
end of the calculation. 

We have now specified t/(x, x 4 ) on the cylindrical boundary of four- 
dimensional spacetime shown in fig. (93.1). This boundary is topologically 
a three-sphere. The winding number of the map on this three-sphere is 
n + — n_. We see this by using eq. (93.3), and noting that the cylindrical wall 
makes no contribution (because = 0 there), the upper cap contributes 
n + , and the lower cap contributes — n_; the sign is negative because the 
orientation of the cap as part of the boundary is reversed from its original 
orientation. 
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Figure 93.1: The boundary in euclidean spacetime. We have a field con- 
figuration with winding number ro_ on the cap at X 4 = —T, and one with 
winding number n + on the cap at X 4 = +T. On the cylindrical surface at 
|x| = R, the field vanishes. 

Since we are interested in large R and T, and since the shape of the 
boundary should not matter in this limit, we instead consider the boundary 
to be a three-sphere at p = (x fl x fl ) 1//2 = 00 . On this boundary, we have a 
map U(x), where this map has winding number n = n + — n_. 

Our first task will be to construct a Bogomolny bound on the euclidean 
action 

S = l f d 4 xTr(Fi IM F lw ) (93.12) 

of a field that obeys the boundary condition 

lim A u (x ) = jjU(x)dnUUx) , (93.13) 

p — >00 r y 

where U(x) is a map with winding number n. The field strength is given 
in terms of the vector potential by 

F IW = d^A v - d v A tl - ig[An, A v \ . (93.14) 

We begin by defining the polar angles x an d t/j, and the azimuthal angle 
(f>, via 

x^ = (sin x sin i/j cos <f>, sin y; sin 'ip sin cp, sin x cos 'ip, cosx) • (93.15) 

(We use V’ rather than 6 in order to avoid any possible confusion with the 
vaccurn angle.) Next we write the winding number in terms of these angles, 

1 r 7T r 7T r 2n 

24^2 l dx l (hlj l (kt> Tr[(Ud a rf)(Udptf)(Ud^)} , (93.16) 


n = 
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where a, (3 , 7 run over x, *0, 0; <9^ = d/d (f>, etc.; and e x ^ = +1. Now 
we write eq. (93.16) as a surface integral over a surface at infinity in four- 
dimensional euclidean space, 

n = - h*j dS v £flU<TT Tr [(Ud^)(Ud a U^(Ud T U^} , (93.17) 

where d ^ = d/dx and e 1234 = +1. (This implies that e px ^ = —1, as can 
be checked by computing the jacobian for this change of coordinates; that 
is why the overall minus sign disappeared.) Now we use eq. (93.13) to write 
the winding number in terms of the vector potential, 

n = ^ I dSp £tlVaT Tr {A u A a A r ) . (93.18) 

Next, we will write this surface integral as a volume integral. 

To do so, we first define the Chern-Simons current , 

J & s = 2 e pvar Tr (A u F aT + ligA u A a A r ) . (93.19) 

This current is not gauge invariant, but the relative coefficient of its two 
terms has been chosen so that its divergence is gauge invariant, 

d^ s = Tr(F^F ar ) 

= 2Tr (F^F^), (93.20) 

where 

F 111 ' = \£^ aT F aT (93.21) 

is the dual field strength. 

On the surface at infinity, the vector potential is a gauge transformation 
of zero, and so the held strength F /w vanishes there. Thus we can use 
eq. (93.19) to write eq. (93.18) as 

n = - £2 / dSfl J ° s ' (93 ‘ 22) 

Using Gauss’s theorem, this becomes 

n = J dAx d n J cs • (93.23) 

Finally, we use eq. (93.20) to get 

n= m^ J d * x Tr(plJ, ' F i^ • (93.24) 
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Thus we have expressed the winding number as a (four-dimensional) volume 
integral of a gauge-invariant expression. 

Now it is easy to construct a Bogomolny bound. We first note that 
F lll, F lw = F^F ^ , and hence 

iTr(iV ± F^f = Tr(F^F^) ± Tr(F^) . (93.25) 

The left-hand side of eq. (93.25) is nonnegative, and so we have 

j d A xTx{F^F^)> J d A xTx(F^F^) . (93.26) 

The left-hand side of eq. (93.26) is twice the euclidean action, while the 
right-hand side is, according to eq. (93.24), \Qix 2 \n\/ g 2 . Thus we have 

S >8n 2 \n\/g 2 . (93.27) 


Eq. (93.27) gives us the minimum value of the euclidean action for a solution 
of the euclidean held equations that mediates between a vacuum configu- 
ration with winding number n_ at X 4 = — oo and a vacuum configuration 
with winding number n+ = n_ + n at X 4 = + 00 . 

From eq. (93.25) we see that we can saturate the bound in eq. (93.27) if 
and only if 

Fimi — (sign n)F flI/ . (93.28) 

We can find an explicit solution of eq. (93.28) for a map with winding 
number n = 1 ; this solution is the instanton. 

We take the map on the boundary to be the identity map, 

rr ,x.A X4 + ix-a 
U{x) = 

P 


( cos x + * sin x cos ip i sin x sin ip e ^ 
i sin x sin ip e + *^ cos x — * sin x cos ip 


(93.29) 


which has n = 1. We then make the ansatz 


A„(x) = p(p)U(x)d^(x) , (93.30) 

where /( 00 ) = 1 (so that the solution obeys the boundary condition) and 
/( 0) = 0 (so that A M is well defined at p = 0). Using eq. (93.14), we find 
that the held strength is 

| [(dufiudvrt + fd^udvtf + f 2 {ud^){ud u rf) 

- (j«<-n/) 1 . (93.31) 
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Using (d p W)U = —U^d p U in the third term, eq. (93.31) can be simplified 
to 

F pv = f [(d p f)Ud„tf + f(l-f)d p Ud„tf - (p^v)\ . (93.32) 

In three-spherical coordinates, we have 

d = pd p + XP~ 1 d x + (ps\nx)~ 1 dp + <j> (psin y sin , (93.33) 


where = d/d<f>, etc., and p is the same radial unit vector as x. Note 
that / is a function of p only, while U is a function of the angles only. We 
therefore have 


Fpx = \f'p~ X Ud x U t , (93.34) 

Fw = |/(l-/)( y o 2 sin 2 xsinV’)" 1 (5 v ,[/^t/ t - d^Ud^) . (93.35) 

Next we note that that eq. (93.21) implies F px = —F^ (since e px ^ = — 1), 
and since F pu = F /w for the instanton solution, we have F px = —F^. 
Thus the right-hand side of eq. (93.34) equals minus the right-hand side of 
eq. (93.35). We then use separation of variables to conclude that 

pf = cf(l-f ) , (93.36) 

Ud x U^ = —c- 1 (sm 2 xsin'i/j)- 1 (d‘ l /,Ud ( f ) U ji — d^Ud^U^) , (93.37) 

where c is the separation constant. If we plug eq. (93.29) into eq. (93.37), 
we find that it is satisfied if c = 2. The solution of eq. (93.36) is then 

f(p) = > ( 93 - 38 ) 

p z + a z 

where a, the size of the instanton , is a constant of integration. The instan- 
ton solution is also parameterized by the location of its center; we have 
used the spacetime origin, but translation invariance allows us to displace 
it. 

If we consider initial and final states whose winding numbers differ by 
more than one, we can construct a mediating solution by patching together 
instantons (or antiinstantons) whose centers are widely separated on the 
scale set by their sizes. Each instanton (or antiinstanton) contributes Si = 
8i T 2 /g 2 to the action, and so the minimum total action is |n + — n_|Si. 

To better understand the role of the 9 parameter, let us consider the 
euclidean path integral, with the boundary condition that we start with a 
state of winding number n_ at xq = — oo, and end with a state with winding 
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number n + at X4 = + 00 . The only field configurations that contribute are 
those with winding number n+ — n_ . We can therefore write 

Zn + ^n.{J) = j VA n+ _ n _ e~ s+JA , (93.39) 

where JA is short for J d A x and the subscript on the field differ- 

ential means that we integrate only over fields with that winding number. 
We see that Z n+ ^ n _(J) depends only on n + — n_, and not separately on 
n+ and n_. This in accord with our previous conclusion that (n'\H\n) 
depends only on n! — n. 

Suppose now that we are interested in starting with a particular theta 
vacuum |$), and ending with a (possibly different) theta vacuum | O'). Then, 
we see from eq. (93.7) that the corresponding path integral is 

Z M (J) = £ e i(»M'-i»-*) Zn+ ^ n _(j). (93.40) 

n_,n+ 

Let n + = n_ + n, so that n + 8' — n-6 = n_ {O' — 6) + n6' . Since Z n+ <^ n _ ( J) 
depends only on n, n_ appears in eq. (93.40) only through a factor of 
e m- ( e _0 - > . Summing over n_ then generates <5(0 / — 0), which implies that 
the value of 9 is time indpendent. (Of course, we already knew this, because 
8 labels energy eigenstates.) We now have 

Z 6 '^e{J) = $(0' - 9) £ e m0 j VA n e~ s+JA . (93.41) 

We can drop the delta function, and just define 

Z e (J) = £ e ine I VA n e~ s+JA . (93.42) 

n •' 

Next, we combine the sum over n and the integral over A n into an integral 
over all A. To account for the factor of e md , we use eq. (93.24); we get 

Zo(J) = JVA exp J d A x Tr + J"A fl . (93.43) 

The vacuum angle 9 now appears as the coefficient of an extra term in the 
Yang-Mills lagrangian. 

We can write the path integral in Minkowski space by setting X4 = it. 
The extra term contains one derivative with respect to X4 , and thus picks 
up a factor of —i. Also, e 1234 = +1 but e 1230 = — 1. Putting all this 
together, we get 

Z e (J) = J VA exp if d 4 x Tr - ^Lf^F^ + (93.44) 
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in Minkowski space. We see that the extra term is gauge invariant, Lorentz 
invariant, hermitian, and has a dimensionless coefficient. We therefore 
could have included it when we first considered Yang- Mills theory. We did 
not do so because this term is a total divergence; see eq. (93.20). We have 
always dropped total divergences from the lagrangian, because they do not 
affect the equations of motion or the Feynman rules. In this case, however, 
the new term does change the quantum physics, as we have seen. We will 
explore this in more detail in the next section. 

So far, we have only discussed SU(2) gauge theory, without scalar or 
fermion fields. What can we say more generally? 

Adding scalar fields has no effect on our analysis. Changing from SU(2) 
to another simple nonabelian group also has no effect; instanton solutions 
always reside in an SU(2) subgroup. If the gauge group is U(l), there are no 
instantons, and hence no vacuum angle. If the gauge group includes more 
than one nonabelian factor, then there is an independent vacuum angle for 
each of these factors. 

On the other hand, adding fermions can significantly change the physics. 
We take this up in the next section. 

Reference Notes 

Instantons are treated in more detail in Coleman and Weinberg II. 

Problems 

93.1) Show that any 2 x 2 special unitary matrix U can be written in the 
form U = a 4 + ia-a, where <24 and the three- vector a are real, and 
a 2 + <24 = 1. 

93.2) Verify that a state of the form of eq. (93.7) is an eigenstate of any 
hamiltonian with matrix elements of the form (n'\H\n) = f(n' — n). 

93.3) The winding number n for a map from S 3 — > S 3 is given by eq. (93.3), 
where U'U = 1. We will prove that n is invariant under an infinites- 
imal deformation of U . Since any smooth deformation can be made 
by compounding infinitesimal ones, this will prove that n is invariant 
under any smooth deformation. 

a) Consider an infinitesimal deformation of U, U — > U + 6U. Show 
that 5W = -W5UU\ and hence that 5(Ud k W) = -Ud k (WdU)Ul 

b) Show that 

5n = [ d 3 x £^ k Tv[(Udi U^)(UdjU^)5(Ud k U^)\ . (93.45) 

247 T Z J 



93: Instantons and Theta Vacua 


581 


Plug in your result from part (a), and integrate dk by parts. Show 
that the resulting integrand vanishes. Hint: make repeated use of 
UdiU' = —diUU', and the antisymmetry of £^ k . 

93.4) Show that if C/ n (x) and £/fc(x) are maps from S 3 — > S 3 with winding 
numbers n and k, then C/„(x)C/fc(x) is a map with winding number 
n + k. Hint: consider smoothly deforming U n {x) to equal one for 
X 3 < 0. How should C/fc(x) be deformed? 

93.5) Use eq. (93.16) to compute the winding number for the map given in 
eq. (93.29), and for a generalization where 4> is replaced by ncj). 

93.6) Use eq. (93.16) to compute the winding number for U n , where U is 

the map given in eq. (93.29). Hint: first show that U can be written 
in the form U = exp where x is a three- vector that you should 

specify. Is your result in accord with the theorem of problem 93.4? 
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94 Quarks and Theta Vacua 

Prerequisite: 77, 83, 93 

Consider quantum chromodynamics with one flavor of massless quark, rep- 
resented by a Dirac field in the fundamental representation of the gauge 
group SU(3). The path integral is 

Z = j VAV'kVy exp i J d 4 x iVljM - jF aiw F« lv - ^L^F^, ; (94.1) 

for the sake of brevity, we have not written the source terms explicitly. 

In addition to the SU(3) gauge symmetry, there is a U(l)y x U(1 )a 
global symmetry of the quark action. However, the U(1)a symmetry is 
anomalous. As we saw in section 77, under a U(1)a transformation 

T -»• e _ia75 T , (94.2) 

T -»• Te' iQ7s , (94.3) 

the integration measure picks up a phase factor, 

DT -»• exp -z / d 4 x TA DT DT . (94.4) 

J 107 T Z ^ 

Using this in eq. (94.1), we see that the effect of a U(1 )a transformation is to 
change the value of the theta angle from 6 to 6 + 2a. Since the value of 6 can 
be changed by making a U(1 )a transformation (which is simply a change of 
the dummy integration variable in the path integral) , we must conclude that 
Z does not depend on 6. Apparently (and surprisingly), adding a massless 
quark has turned the theta angle into a physically irrelevant, unobservable 
parameter. 

How do we reconcile this with our analysis in the previous section, 
where we concluded that instanton-mediated tunneling amplitudes make 
the vacuum energy density depend on 01 The answer is that when we 
perform the integral over the quark field in eq. (94.1), we get a functional 
determinant; the path integral becomes 

Z = J VA det (iJp)e iS e ine , (94.5) 

where S is the Yang-Mills action and n is the winding number. We see from 
eq. (94.5) that Z would be independent of 6 if gauge fields with nonzero 
winding number did not contribute. This will be the case if det (A) van- 
ishes for gauge fields with u / 0. We conclude that ilp must have a zero 
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eigenvalue or zero mode whenever the gauge field has nonzero winding num- 
ber. This renders Z independent of 0. 

Now consider adding a mass term for the quark. If we write the Dirac 
field T in terms of two left-handed Weyl fields x and 


T = 



the mass term reads 




= -mxt, ~ ■ 


(94.6) 


(94.7) 


We have allowed m to be complex: m = \m\e 1 ^ . In terms of T, eq. (94.7) 
can be written as 

Anass = • (94.8) 

A U(1)a transformation changes (j> to (j) + 2a. Since 6 simultaneously 
changes to 9 + 2a, we see that (j) — 9, or equivalently me - , is unchanged. 
Thus, the path integral can (and does) depend on me - , but not on m and 
9 separately. 

With more quark fields, the mass term is C = —M t jXiQ + h.c.; a U(1)a 
transformation changes the phase of every Xi and Q by e* a , and so every 
matrix element of M picks up a factor of e 2ia . Simultaneously, 9 changes 
to 9 + 2Na, where N is the number of quark fields. Thus (det M)e~ te is 
invariant under a U(1)a transformation. 

To understand the effects of the theta angle on hadronic physics, we 
turn to the effective lagranagian (for the case of two light flavors) that we 
developed in section 83, 

C = -hflTrd^dJJ + v 3 Tr (MU + MW) 

+ iN0N - m N N(U^P h + UP n )N 

- l(g A -l)iN^(Ud^P L + rfdnUP^N 

- aN(MP L + M ] P n )N - c 2 iV(t/ t Af t t/ t P L + UMUP n )N 

- c 3 Tr(MU + M t D t )]V(t/ t P L + UP r )N 

- c 4 Tr(MU - M t t/ t )iV(t/ t P L - UP n )N , (94.9) 

where U is a 2 x 2 special unitary matrix field representing the pions, N is 
the field for the nucleon doublet, 


M = 


m,. 


0 


=—* 0/2 


(94.10) 


is the quark mass matrix, u 3 is the value of the quark condensate, and g A 
and Ci are numerical constants. (The terms in the last three lines were 
introduced in problem 83.8.) 
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For the case of 6 = 0, the potential 

V(U) = —v 3 Tr MU + h.c. (94.11) 


is minimized by U = I, and we can expand about this point in powers of the 
pion fields, as we did in section 83. However, for nonzero 9, the minimum 
of V(U) occurs at U = Uq , where Uq is diagonal (because M is) and has 
unit determinant (because U is required to have unit determinant). We 
can therefore write 

( e + l< t> 

U 0 = 

V o 

We determine (j) by minimizing 


(94.12) 


V(U 0 ) 


2v- 


rn u cos( 


i<9) + m d cos((l) + \0) ; 


(94.13) 


the result is 

tan0 = —4 — tan(b0) . (94.14) 

TTl'u T C Iff 

As we will see shortly, experimental results require the value of |#| to be less 
than 10~ 9 ; therefore, we can work to first order in an expansion in powers 
of 9. For 9 <C 1, eqs. (94.12) and (94.14) can be written in the elegant form 

MU 0 = M 0 - iOrhl + 0{9 2 ) , (94.15) 

where 

/m« °\ , 

M 0 = (94.16) 

V 0 m d J 

is the quark mass matrix with 9 set to zero, I is the identity matrix, and 


m u m d 

m = 

m u + m d 


(94.17) 


is the reduced mass of the up and down quarks. 

We can now expand in powers of the pion fields. Though it is not at all 
obvious, it turns out that the most convenient way to define the pion fields 
is by writing 

U{x) = uqu 2 {x)uq , (94.18) 

where Uq = Uq and u(x ) = exp[i7r a (x)T a // vr ]. We also define U(x) = u 2 (x), 
and a new nucleon field M via 


N = ( u 0 uP L + ulu’PjQAf . 


(94.19) 
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Substituting eqs. (94.18) and (94.19) into eq. (94.9), and using uqMuq = 
MU() (which follows because uq and M are both diagonal, and hence com- 
mute), we ultimately get 

C = -\f^Tr d^U + v 3 Tr[(MU 0 )U + (MUo^U^} 

+ iATfiAT — rriNAfAf + AffAf — gjtf^sAf 

— \c + JJ[u(MUq)u + u\M f/o) V] AT 
+ \c^AT[u{MU q )u — il 1 (MUq)^ u^jsAf 

- c 3 Tr[(MU 0 )U + (MU 0 )W^]AfAf 

+ c 4 Tr[(MU 0 )U - (MU 0 )W] Af^AT , (94.20) 

where v M = ^i[u^(d p u) + u(d p v))\, a p = \i[u\d p u) — u{d^v))\, and c± = 
ci i C 2 • Eq. (94.20) is exactly what we found in section 83, except that the 
quark mass matrix M has been replaced everywhere by MUq. 

We can now use eq. (94.15) to get the 0(9) contributions to C. Using 
the fact that Tr (U — U') vanishes in the case of two light flavors, we find 


£ e 


—idrh 


\c+N(U - U^)Af + \c-JJ(U + U*) 75 AA 


+ c 4 Tr(U + W)Af^ 5 AT . 


(94.21) 


Expanding in powers of the pion fields yields 

Cg = —i9m(c-+4c 4 )AJ^N — (Oc+rh/ /■ n )'K ar ATcr a N + . . . , (94.22) 

where we used T a = ^ a a . We can eliminate the first term with a field 
redefinition of the form Af — ► e~ iai5 Af. This generates some new terms 
in eq. (94.20), but all have at least two factors of quark masses, and hence 
can be neglected. The second term in eq. (94.22) provides a pion-nucleon 
coupling that violates both parity P and time-reversal T (equivalently, 
CP). 

The value of c+ can be fixed by baryon mass differences. The c+ 
term in eq. (94.20) makes a contribution of c + (m u — mg) to the proton- 
neutron mass difference, m p — m n = — 1.3 MeV. Using m u = 1.7MeV 
and mg = 3.9 MeV yields c+ = 0.6. However, there is a comparable 
electromagnetic contribution to the proton-neutron mass difference. We 
get a better estimate from the masses of baryons with strange quarks, 
c + (m s — \ m u — \rrid) = o —m^o = 122 MeV; using m s = 76 MeV yields 
c+ = 1.7. (All these values assume the MS renormalization scheme with 
H = 2GeV.) 

For comparison with the interaction of eq. (94.22), the dominant (P and 
CP conserving) pion-nucleon interaction comes from the last term in the 
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► X ► + » x — ► — A — ► 

n p n n p n 


Figure 94.1: Diagrams contributing to the electric dipole moment of the 
neutron that are enhanced by a chiral log. The CP violating vertex is 
denoted with a cross. 

second line of eq. (94.20), and is 

= {gJU)d^ a JfT a ^ lb M , (94.23) 

where g A = 1.27. As we did in section 83, we can integrate by parts to put 
the derivative on the nucleon fields, and then (for on-shell nucleons) use 
the Dirac equation to get 


C -kNN = -i{9Arn N / fn)Tr a A('a a 'y 5 Ar . (94.24) 

The strongest limit on a CP violating pion-nucleon coupling comes from 
measurements of the electric dipole moment of the neutron. The Feynman 
diagrams of fig. (94.1) contribute to an ampitude of the form 

T = -2iD(g 2 )£*(g)u^(p / )5’^XX5«s(p) , (94.25) 

where q = p' — p. In the q — > 0 limit, this corresponds to a term in the 
effective lagrangian of 


C = D(0)F /iV nS lu 'i'y 5 n , (94.26) 

where n is the neutron field; see section 64. If the factor of ^75 was absent, 
this would represent a contribution of D( 0) to the magnetic dipole moment 
of the neutron. To account for the factor of '< 75 , we use 

5^*75 = -\er pa S pa (94.27) 

to see that eq. (94.26) is equivalent to 

C = -D(0)F^nS^ u n , (94.28) 

where F IJV = ^£^ p< 7 F pcr is the dual field strength. Since B = — E, eq. (94.26) 
represents a contribution of D{ 0) to the electric dipole moment of the 
neutron d n . 
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Figure 94.2: Momentum flow in the diagrams of fig. (94.1). 


The salient feature of the diagrams shown in fig. (94.1) is that the pho- 
ton line attaches to the pion line. These diagrams are enhanced by a chiral 
log ln(A 2 /m 2 ) ~ 4.2, where A ~ 4irf n is the ultraviolet cutoff in the effec- 
tive theory. No other contributing diagrams have this enhancement; it is an 
infrared effect, due to the light pion. Of course, 4.2 is not an impressively 
large number, and so we cannot be certain that the remaining contribu- 
tions are not significant. These contributions depend on coefficients in the 
effective lagrangian that are not well determined by other experimental 
results. 

Using 


7 r a 


( 7T° \ph r + \ 

\ \/2tT “ — 7T° ) 


(94.29) 


we can write the charged-pion terms in eqs. (94.24) and (94.21) as 


C Tnn = -i^iaAmN/f^in+p^n + iT n^p) , (94.30) 

C eTNN = -V^iOc+m/ f v )(-K + pn + n~np) . (94.31) 


From these we read off the pion-nucleon vertex factors. We label the in- 
ternal momenta as shown in fig. (94.2); for small q, £ is a pion momentum 
that should be cut off at A ~ 4iTf n . Because the terms of interest have a 
chiral log produced by an infrared divergence at small £, we can treat t as 
much less than p and p' . Thus the internal proton is nearly on-shell, which 
justifies the use of eq. (94.24) for the CP conserving interaction. 

The diagrams of fig. (94.1) yield an amplitude of 

iT = (j) { ie )(^ 9 Arn N / f v )(-iV29c + fh/ 

(2l M ) u'\(-f-p + mjy)75 + 75 (~f~p + m N )]u 
((£+p) 2 + m 2 N ){(£+\q) 2 + ml)((£-\q) 2 + ml) ’ 


x 


(94.32) 
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where p = \(yp' +p)- Using {7^, 75} = 0, the spinor factors in the numerator 
simplify to 

u'[. . .]u = 2rriN u'^u . (94.33) 

From the spinor properties established in section 38, it is easy to check that 
u'^u vanishes when p' = p ; thus u'^u must be O(q), and so we can set 
q = 0 everywhere else. Also, taking £ <C p, we can set {£+p) 2 + m 2 N = 2 p-£ 
in the denominator of eq. (94.32). We now have 


T = 4 {eeg A c + fhm 2 N / fl)£^ 


r A d 4 £ (2-P) u'^u 

0 (27t) 4 (2 p-£)(£ 2 + m 2 ) 2 


Integrating over the direction of £ results in 

pfl pfl 

p-£ p 2 m 2 N 


Next we use the Gordon identity (see problem 38.4) 
p^u'^u = u’S^qQ^u + 0(q 2 ) , 


which verifies that u'^y^u is linear in q. We now have 


T = -4{e6g A c+m/ fl^u' S^qQ^u 


r A dH l 

0 (27 r) 4 (£ 2 + ml) 2 


(94.34) 


(94.35) 


(94.36) 


(94.37) 


In the limit m, r — > 0, the integral diverges at small £, generating a chiral 
log. This infrared divergence can only arise from diagrams with two pion 
propagators, which is why it appears only if the photon is attached to the 
pion. 

After a Wick rotation, the integral evaluates to (?yi67r 2 ) ln(A 2 /m))). 
Comparing with eq. (94.25), we see that the electric dipole moment of the 
neutron is 


dn 


edg A c + m 
8 7T 2 / 2 


ln(A 2 /m 2 ) + 0(1) . 


(94.38) 


Putting in numbers ( g A = 1.27, c+ = 1.7, rh = 1.2 MeV), we find 

d n = 3.2 x 10“ 16 6 ecrn . (94.39) 


The experimental upper limit is \d n \ < 6.3 x 10 -26 ecm, and so we must 
have \6\ < 2 x 10 -10 . 

Such a small value for a fundamental parameter cries out for an ex- 
planation; this is the strong CP problem. Several solutions have been pro- 
posed. (1) The up quark mass may actually be zero, since a massless quark 
renders 9 unobservable (and effectively zero). This requires higher-order 
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corrections in the quark masses to account for the masses of the pseurlo- 
goldstone bosons. (2) The fundamental lagrangian may be CP invariant, 
and the observed CP violation in weak interactions due to spontaneous 
breaking of CP symmetry. (3) The theta parameter may be promoted to 
a held, the axion, which would minimize its energy by rolling to 8 = 0; see 
problem 94.2. All of these solutions have interesting physical consequences. 

Reference Notes 

Quarks and theta vacua are discussed in Coleman, Ramond, II, and Wein- 
berg II. 


Problems 

94.1) Carry out the held redefinition discussed after eq. (94.22), and verify 
that all new terms generated in the lagrangian are suppressed by at 
least two powers of quark masses. 

94.2) Consider adding to the Standard Model a massless quark, represented 

by a pair of Weyl fermions x and £ in the 3 and 3 representations 
of SU(3). Also add a complex scalar 4> in the singlet representa- 
tion. Assume that these helds have a Yukawa interaction of the form 
-^Yuk = + h.c., where y is the Yukawa coupling constant. As- 

sume that the scalar potential V (<k) depends only on 

a) Show that the lagrangian is invariant under a Peccei- Quinn trans- 
formation x ” e *°A> £ — 1 ► e* Q £, <h — e^ 2 * a< h, all other helds un- 
changed. 

b) Show that this global U(1)pq symmetry is anomalous, and that 
9 — * 9 + 2a under a U(1)pq transformation. 

c) Suppose that V(3>) has its minimum at |4>| = f/y/ 2, with f ^ 0. 
Show that this gives a mass to the quark we introduced. 

d) Write <h = 2 _1 / 2 (/ + p)e m ^ , where p and a are helds. Argue that, 
in eq. (94.10), we should replace 9 with 9 + a / '/, and add to eq. (94.9) 
a kinetic term — ^d^ad^a for the a held. 

d) Show that the minimum of V (U), dehned in eq. (94.11), is at U = / 
and a = —f9. Show that P and CP are conserved at this minimum. 

e) The particle corresponding to the a held is the axiom, compute its 
mass, assuming / 3> / 7 r . 

f) Note that if / is large, the extra quark becomes very heavy, and 
the axion becomes very light. Show that couplings of the axion to 
the hadrons are all suppressed by a factor of 1//. 
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95 Supersymmetry 

Prerequisite: 69 

Supersymmetry is a continuous symmetry that mixes up bosonic and fer- 
mionic degrees of freedom. A supersymmetric theory (in four spacetime 
dimensions) has a set of supercharges Q a A, where a is a left-handed spinor 
index, and A is an internal index that runs from 1 to J\f, where the al- 
lowed values of M are 1, 2, and 4. The supercharges can be obtained as 
integrals over d?x of the time component of a supercurrent. The supercur- 
rent is found via the Noether procedure, once we have identified the set of 


supersymmetry transformations that leaves the action invariant. 

The supercharges Q a A and their hermitian conjugates together 

with the generators of the Poincare group and M IW , obey a supersym- 

metry algebra 

[< QaA,P M ] =0, 

(95.1) 

[Ql A ,n = o, 

(95.2) 

[< QaA,M H = (SDa C QcA , 

(95.3) 

[Q{ A ,M^] = (Sna 6 QlA, 

(95.4) 

{ QaA'iQbB } ~ ^AB^ab 5 

(95.5) 

{QoA,QIb} = -Mabo^Pv 

(95.6) 


Eqs. (95.1) and (95.2) simply say that the supercharges are conserved, and 
eqs. (95.3) and (95.4) simply say that their spinor indices are indeed spinor 
indices. In eq. (95.5), Zab = —Zba must commute with Q a Ai and 1W 7 "', 
and so represents a central charge in the supersymmetry algebra. We will 
be concerned only with the case of M = 1 supersymmetry: the index A 
then takes on only one value (and so can be dropped), and Zab = 0 . 

M = 1 supersymmetric theories are most easily formulated in super- 
space , where we augment the usual spacetime coordinate x^ with an anti- 
commuting left-handed spinor coordinate 9 a and its right-handed complex 
conjugate 0*. We define superfields 4>(x, 9, 9*) that are functions of all these 
coordinates. 

The energy-momentum vector generates translations of the usual space- 
time coordinate x^ in the usual way, 

[$(x, 9, 9*), P M ] = — ic^4>(x, 9, 9*) . (95.7) 

By analogy, we would expect 


Mx,9,9*),Q a ] 


iQ a $(x,9,9*) , 


(95.8) 
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[$(*, 0, 9*), qI] = -iQ* d $(*, 9, 9*) , (95.9) 

where Q a and Q* are appropriate differential operators. To figure out 
what they should be, we first introduce the anticommuting derivatives d a = 
d/d9 a and d* = d/d9* a , which obey d a 9 c = 5 a c and dt9* c = 5a c ■ Note, 
however, that complex conjugation should reverse the order of a product 
of Grassmann variables, in order to maintain consistency with hermitian 
conjugation. Then we have 5 a c = ( d a 9 c )* = 9* c (d a )* = — (d a )*9* c , which 
implies 

(a a )* = -d* . (95.10) 

Thus, our first guess for the differential operators in eqs. (95.8) and (95.9) 
is Q a = d a and Q* = — d?. However, this choice is inconsistent with 

{Qa,Ql } = — 

An alternative that avoids this pitfall is 

Qa = +d a + iay*%, (95.11) 

Ql = ~d*a - lO^d, • (95.12) 

These obey the anticommutation relations 

{Qa,Qb} = {Ql,Q*} = o, (95.13) 

{Qa,Ql} = -^Kad, ■ (95.14) 

It is straightforward to check that eqs. (95.5-95.12) are now mutually com- 
patible. In particular, the Jacobi identity 

{[<*>, QIQ ] } + {[<*>, Q ] IQ} - [<*>, {Q, Q^}\ = o (95.15) 

is satisfied. 

Next we introduce the supercovariant derivatives 

V a = +d a -ia^9*%, (95.16) 

V* = -d* + i9 c a^ ■ (95.17) 

These obey 

{V a ,V b } = {V*,V*} = 0 , (95.18) 

{V a ,Vt} = 2 io^dv, (95.19) 

{V a , Q b } = {V a , Q*} = { V *, Q b } = { V l, Q*} = 0 . (95.20) 

Because of eq. (95.20), we could impose the condition V a § = 0 or = 0 
on a superfield, and this condition would be preserved by the supersymme- 
try transformations of eqs. (95.8) and (95.9). A superfield that obeys 


D*$(x,M*) = 0 


(95.21) 
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is a left-handed chiral superfield. Its hermitian conjugate &(x, 9, 9*) obeys 

V a &(x,9,9*) = 0, (95.22) 

and is a right-handed chiral superfield. 

We can solve eq. (95.21) by introducing 

if = x 11 - i9 c a'fi9* d , (95.23) 

and noting that 

V*8 o = 0 and V\t = 0. (95.24) 

(When verifying T>^y^ 1 = 0, remember that there is a minus sign from 
pulling the d ? through 6 C .) Thus, any superfield $>(y,6) that is a function 
of y and 6 only is a left-handed chiral superfield. 

We can expand $(y,Q) in powers of 0; because 6 is an anticommuting 
variable with a two- valued index, we have 9 a 9 b 9 c = 0, and so the expansion 
terminates after the quadratic term. We thus have 

Hy, 9) = A(y ) + v/2 9il>(y) + 88F(y) , (95.25) 

where A(y) and F(y) are complex scalar fields, fi a (y) is a left-handed Weyl 
field, and we have used our standard index-suppression conventions: 9fi = 
9 a fi> a and 99 = 9 a 9 a - The factor of root- two is conventional. 

We can now substitute in eq. (95.23), and continue to expand in powers 
of 9 and 9* . Making use of the spinor identities 

9 a 9 b = F\99e ab , 9 a 9 b = -\99e ab , (95.26) 

6*9* = -i 9*9 * , 9* d 9* b = +\9*9*e hb , (95.27) 

where 99 = 9 a 9 a and 9*9* = (99)* = 9* il 9 * a , along with the Fierz identity 

(9a^9*)(9a u 9*) = -\999*9*g^ , (95.28) 

we find 

$(x, 9 , 9* ) = A(x) + V29fi(x) + 99F(x) - i(9a fM 9*)d fl A(x ) 

- -t=i999*d^d^(x) + \999*9* d 2 A(x) . (95.29) 

Let us investigate the properties of a left-handed chiral superfield under 
a supersymmetry transformation, given by eqs. (95.8) and (95.9). It is 
easiest to use the y and 9 coordinates, since 

Qa9 b = 5 a b , Qay» = 0 , 

Qlo b = o , qw = . 


(95.30) 
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We thus have 


Qa${y,9) = d a ${y,9) 


= V2MV) + 29 a F(y) , 

(95.31) 

Ql<$>(y,6) = -2 i9 c a>\d^{y,9) 


= -2i8 c a^d„A(y) + iV299d^ c (y)a ^ , 

(95.32) 


where is with respect to y: we used eq. (95.26) to get the last line. We 
can now find the supersymmetry transformations of the component fields 
A. i/j, and F by matching powers of 9 on each side of eqs. (95.8) and (95.9). 
Remembering that the Q and Q t operators anticommute with 8 and 9*, we 
get 

[A, Q a ] = -W 20 a , [A, Q\] = 0 , (95.33) 

{ipcQa} = - iV2£ ac F , {'0c, Q\} = - VZa^d^A , (95.34) 

[F, Qa} = o , [F, Qt] = V2d^ c a^ , (95.35) 

where all component fields have spacetime argument y. However, y is 
arbitrary, and so we are free to replace it with x. 

Eq. (95.35) is the most important: it tells us that the supersymmetry 
transformation of the F field is a total derivative. Therefore, / d 4: xF(x ) 
is invariant under a supersymmetry transformation, and hence could be a 
term in the action of a supersymmetric theory. 

The product of two left-handed chiral superfields is another left-handed 
chiral superfield; this is obvious from eq. (95.25), and the fact that the 
9 expansion always terminates with the quadratic term. For two chiral 
superfields <£>i (y,9) and <i> 2 (y,9), we have 

$ 1 cfi 2 = AiA 2 + V29(Aiip2 + Mi>i) + 99(AiF 2 + A 2 Fi - 0 1 ^ 2 ) • (95.36) 


More generally, given a set of left-handed chiral superfields <hj, we can 
consider a function of them W (4>); this function is itself a left-handed chiral 
superfield. Its F term (the coefficient of 99) is 


W($) 


dW (A) _ 1 d 2 W{A) 

dAi 1 2 dAidAj 


V’iVh , 


(95.37) 


where repeated indices are summed. The spacetime integral of this term 
(like the spacetime integral of any F term) is invariant under supersymme- 
try, and hence could be a term in the action of a supersymmetric theory. 
In this case, the function W(3>) is called the superpotential. 
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We still need kinetic terms. To get them, we first investigate the prop- 
erties of a vector superfield. A vector superfield V(x, 6,6*) is hermitian, 

[v(x, e, e * )] f = v(x, e , e* ) , (95.38) 

but is not subject to any other constraint. Its component expansion is 

V(x, e, 9*) = C(x) + 6x(x) + 9*x f (s) + 99M(x) + 9*9* (x) 

+ 9^ i 9*v ll {x) + 999* A f (x) + 9*9*9\{x) 

+ \999*9*D(x) , (95.39) 

where C and D are real scalar fields, M is a complex scalar field, x and A 
are left-handed Weyl fields, and v ^ is a real vector field. 

Following the analysis that led to eq. (95.35), we find 

[D, Qa } = -KA^ C , [D, Qh = +d l X< t • (95.40) 

We see that the supersymmetry transformation of the D component of a 
vector superfield is a total derivative. Therefore, / ci 4 x D{x) is invariant 
under a supersymmetry transformation, and hence could be a term in the 
action of a supersymmetric theory. 

Consider the product of a left-handed chiral superfield $>(x,9,9*), as 
given by eq. (95.29), and its hermitian conjugate 

( x , 9 , 9*) = A(x) + V2 9* A (x) + 9*9* (x) + i(9a»9*)d lx A ! (®) 

+ ii0*9*d^ {x)X9 + \999*9*d 2 A\x ) . (95.41) 

The product is obviously hermitian, and so is a vector superfield. 
After considerable use of eqs. (95.26-95.28), we find that the D term (the 
coefficient of 999*9*) of this vector superfield is 

$t$| = - A'XdfjA + \AdX + \A ] d 2 A 

+ \AXdA - 

+ FX . (95.42) 

The spacetime integral of this term (like the spacetime integral of any D 
term) is invariant under supersymmetry, and hence could be a term in the 
action of a supersymmetric theory. After some integrations by parts, and 
dropping total divergences, we find 

$t$| = -dXd^A + + FX . (95.43) 

We see that we have standard kinetic terms for the complex scalar field A 
and the left-handed Weyl field ip. We also have a term with no derivatives 
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for the complex scalar field F. The F field is therefore called an auxiliary 
field. 

If we consider a set of left-handed chiral superfields <R, we get a hermi- 
tian, supersymmetric action if we take as the lagrangian 


C = + (w($)|^ + h.c.) 


(95.44) 


where the index in the first term is summed. Since iq appears only quadrat- 
ically and without derivatives, we can easily perform the path integral over 
it. The result is equivalent to solving the classical equation of motion for 


F. 


dc 

dFi 


r t | W(A) 
i dAi 


= 0 , 


(95.45) 


and substituting the solution back into the lagrangian; the result is 


£ 


-d^Ajd^Ai + iipj 


dW(A ) 

2 1 

dAi 

2 


d 2 W (A) 
dAidAj 


fi’i'fij + h.c. , 


(95.46) 


where the indices are summed in each term. 

As an example, let us consider a single left-handed chiral superfield, 
with superpotential 

W(A) = \m,A 2 + \gA z . (95.47) 

This is the Wess-Zumino model. The scalar potential is 


V(A) = \8W/dA \ 2 

= m 2 A^A + r;gm(A^A 2 + A^ 2 A) + ig 2 (A^A) 2 . (95.48) 


We see that the scalar has mass m. The last term in eq. (95.46) becomes 
£mass+Yuk = - ^gAfifi + h.c. . (95.49) 


We see that the fermion also has mass m, and a Yukawa interaction with 
the scalar. The Yukawa couping is related (by supersymmetry) to the cubic 
and quartic self-interactions of the scalar. 

Next we would like to introduce gauge fields. Recall that the vector 
superfield V(x,6,9*) has among its components a real vector field v fl (x) 
that could be identified as an abelian gauge field. (Later we will add an 
adjoint index to the superfield in order to get a nonabelian gauge field.) 

We need to generalize the notion of a gauge transformation to super- 
fields. We begin by noting that if H is a left-handed chiral superfield, then 
i(Et — 5) is a vector superfield. We then define a supergauge transformation 
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We will attempt to construct actions that are invariant under eq. (95.50). 
Following the pattern of eqs. (95.29) and (95.41), we write 

H(®, 6, 6*) = B{x ) + 9£(x) + 99G{x) 

-itOa^O* )d lt B(x) + . . . , (95.51) 

H* (x, e, 9*) = B f (x) + 9*^ ( X ) + 6*6* Gfl (. x ) 

+i(9a g 9*)d^B\x) + ... . (95.52) 

If we set B = |(6 + ia), where a and b are real scalar Helds, we find 

i(H f - H) = a - i9£ + i6*g - i99G + i9*9*G^ - (9a g 9*)d fl b + . . . . (95.53) 

From eq. (95.39), we see that the supergauge transformation of eq. (95.50) 
results in 

C -»• C + a , 

x x - *£ » 

M -> M - iG , 

* Up — d^b . (95.54) 

The last of these is the usual abelian gauge transformation. The first three 
allow us to gauge away the C, x, and M components of a vector superfield. 
That is, we can make a supergauge transformation with a = —C, £ = — iy 
and G = —%M\ in this gauge, known as Wess-Zumino gauge , the vector 
superfield becomes 

V = (9a /x 9*)v fi + 999* A f + 9*9*9X + \999*9*D . (95.55) 

Note that we still have the freedom to make the supergauge transformation 
of eq. (95.50) with B(x) = ^b(x), and that this still implements the ordinary 
abelian gauge transformation of eq. (95.54). 

Now consider a left-handed chiral superfield <b that has charge +1 under 
a U(l) gauge group. We take the kinetic term for <F to be 

A «n = &e~ 29V <f>\ D , (95.56) 

where g is the gauge coupling. The vector superfield ^'e~ 2gV <h is clearly 
invariant under the supergauge transformation 


$ - 

in 

1 

QJ 

t 

(95.57) 

'fit _ 

■* $t e +2*<?St ^ 

(95.58) 

V - 

-> V + i(S f -3) . 

(95.59) 
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Let us evaluate eq. (95.56) in Wess-Zumino gauge, where we have 

v 2 = -\eee*e*v^v ^ , ( 95 . 60 ) 

F 3 = 0 . (95.61) 

The exponential factor in eq. (95.56) becomes 

e - 2 9 v = 1 _ 2 g{9a lx 9*)v ll - 2g999*X ] - 2g9*9*9X 

- 999*9* {gD + 5 2 u%) . (95.62) 

The relevant terms in 4>t<3? are 

= A f A + y/29*^A + V29i/A^ 

+ (9a^9*)(^a^ - iA^d^A + iAd^A/) 

+ ... + 999*9* {&<$>) D , (95.63) 

where we used 2 (9*^)(9/j) = (9a fJ '9*)(^a fl ip) to get the first term in the 
second line, and (<hl<h)£> is given by eq. (95.42). 

Combining eqs. (95.62) and (95.63), taking the D term, and performing 
the same integrations by parts that led to eq. (95.43), we find 

$t e - 2 s y $ = -(D^A) ] D^A + i^a^D^ + F ] F 

+ V2g^X^A + V2gA^X'il; - gA^DA , (95.64) 

where = <9 M — igv y is the usual gauge covariant derivative acting on held 
of charge +1. 

We still need a kinetic term for the vector superfield. To get it, we first 
introduce a superfield that carries a left-handed spinor index, 

W a = ±V*V*“V a V . (95.65) 

Since the two components of V* anticommute, we have = 0. 

Thus W a obeys T>lW a = 0, and is therefore a left-handed chiral super- 
held. Furthermore, W a is invariant under the supergauge transformation 
of eq. (95.50). To see this, we hrst note that T>lV* a V a annihilates (be- 
cause V a does). Then, we use eq. (95.19) to write 

V*V* d V a = _( V * Va + 2ia/ d d^)V* h . (95.66) 

Thus, V* h V* a V a also annihilates H (because V* a does). Therefore, lT a is 
invariant under eq. (95.50). 

Since W a is a left-handed chiral superheld, it has an expansion in the 
form of eq. (95.25). To hnd the component helds of W a , we set x = y + 
i9(jL L 9* in eq. (95.55), and expand in 9 and 9* . The result is 

V = (9a^9*) Vti + 999* A f + 9*9* 9X + \999*9*{D - id%) , (95.67) 
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where all component fields have spacetime argument y. From eq. (95.24), 
we see that T>* = —9? when it acts on a function of y , 9, and 9*. We also 
have 

V a = ( V a 9 c )d c + (V a 9* d )8* + (: V a y **)fy = da + 0- 2ia^9*% , (95.68) 
where d p is with respect to y. Using eq. (95.68), we hnd 

VaY = 0*0* A a + 9 a (D - id-v) - i(cr^a u 9) a d p v u + i00(u^9 At A t ) a 

+ ... . (95.69) 

When we act on D a V with D?D* a = <9t<9* a to get W a , only the coefficient 
of 0*0* survives. Since <9?9* a (0*0*) = 4, we find 

Wa = A a + 0 a (-t> - id-v) - i(a^a u 9) a d p v u + i99{a^d p X^) a . (95.70) 

We can simplify eq. (95.70) by using the identity 

KOa 6 = -<r&a C - 2 i(SDa b • (95.71) 

Remembering that is antisymmetric on p z/, and defining the held 
strength 

Fpv = <9^ - d u Vfj, , (95.72) 

we get 

= A a + 9 a D - (SDa C 9 c F^ + ma^d^ . (95.73) 

We see that lU a involves the vector held only through its gauge-invariant 
held strength F /llt . Since W a is supergauge invariant, this is to be expected. 

Next, consider the F term of W a W a . This term is Lorentz invariant, 
and its spacetime integral (like the spacetime integral of any F term) is 
invariant under supersymmetry, and hence could be a term in the action of 
a supersymmetric theory. Working out the components, we hnd 

W a W a = 2iA a aF d p A fd - \Tr{SrsnF„„F pc + D 2 , (95.74) 

where Tr(S^ v S^ 7 ) = {S^ u ) a c (S^ a ) c a . To get the spin matrices into this 
form, we used the fact that (S^ u ) ac is symmetric on a c. Now we use 
the identity 

Tr{SrSF) = \{g^9 vrT - <fV P ) - \ie ta,pa (95.75) 

to get 

W a W a \ F = 2iA a a^d^“ - \F^F pu - \iF^F pv + D 2 , 


(95.76) 
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where F^ LU = TjS llVf>rJ Fp rJ . We can now identify the kinetic term for the 
vector superfield as 


£kin = i W a W a 


h.c. 


= iX^a^dpX - \Fi JV F IJV + \D 2 . 


(95.77) 


We integrated by parts and dropped total divergences to get the second 
line. We see that we have the standard kinetic terms for the gauge field 
and the gaugino field A, while D is an auxiliary field. 

All of this generalizes in a straightforward way to the nonabelian case. 
We define a matrix- valued vector superfield V = V a T£, and a matrix- 
valued chiral superfield S = S“T^, where a is an adjoint group index. A 
chiral superfield $ in the representation R still transforms according to 
eqs. (95.57) and (95.58), but for the vector held we have 

e -2W ^ e -2ig~i e -2gV e +2ig~ . (95.78) 


this reduces to eq. (95.59) in the abelian case. 

The field-strength superfield is now 

W a = -± j V*V* d e +29V V a e- 29V . 

Under a supergauge transformation, 

XV a -► e~ 2i9S W a e +2i9S . 


(95.79) 


(95.80) 


Eq. (95.73) still holds, but the derivative that acts on is now the gauge co- 
variant derivative for the adjoint representation, and F 9,u now includes the 
usual nonabelian commutator term. These changes also apply to eq. (95.77), 
where we must also trace over the group indices and (for proper normal- 
ization) divide by the index T(R). 


Reference Notes 


Introductions to supersymmetry can be found in Martin , Siegel, Weinberg 
III, and Wess & Bagger. 


Problems 

95.1) Use eq. (95.6) to show that the hamiltonian is positive semidehnite, 
and that a state with zero energy must be annihilated by all the 
supercharges. 
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95.2) Supersymmetry is spontaneously broken if the ground state |0) is not 
annihilated by all the supercharges. 

a) Use the first of eqs. (95.34) to show that supersymmetry is spon- 
taneously broken if (0|F|0) / 0. 

b) Compute {A a ,Qb}- Use the result to show that supersymmetry is 
spontaneously broken if (0|D|0) / 0. 

95.3) Consider a supersymmetric theory with three chiral superfields A, B, 
and C, and a superpotential W = mBC + kA(C 2 — v 2 ), where m and 
v are parameters with dimensions of mass, and n is a dimensionless 
coupling constant. This is the O Raifeartaigh model. 

a) Show that one or more F components is nonzero at the minimum 
of the potential, and hence that supersymmetry is spontaneously bro- 
ken. 

b) Show that the potential is minimized along a line in field space, 
and find the masses of the particles at an arbitrary point on this 
line. You should find that there is a massless Goldstone fermion or 
goldstino that is related by supersymmetry to the linear combination 
of F fields that gets a nonzero vacuum expectation value. 

95.4) Supersymmetric Quantum Electrodynamics. Consider a supersym- 
metric U(l) gauge theory with chiral superfields $ and with charges 
+1 and — 1, respectively. Here the bar over the $ in the field is part 
of the name of the field , and does not denote any sort of conjugation. 
We include a gauge invariant super potential W = 

a) Work out the lagrangian in terms of the component fields. 

b) Eliminate the auxilliary fields F, F, and D. 

95.5) In a supersymmetric gauge theory with a U(l) factor, we can add a 
Fayet-Illiopoulos term C ¥l = efD to the lagrangian, where D is the 
auxilliary field for the U(l) gauge field, and £ is a parameter with 
dimensions of mass-squared. 

a) Explain why adding this term preserves supersymmetry. Explain 
why the corresponding gauge field cannot be nonabelian. 

b) Add this term to the SQED lagrangian that you found in problem 
94.4, and eliminate the auxilliary fields. 

c) Minimize the resulting potential. Show that supersymmetry is 
spontaneously broken if £ is in a certain range. 

95.6) R symmetry. Given a supersymmetric gauge theory (abelian or non- 
abelian), consider a global U(l) transformation that changes the 
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phase of the gaugino fields, A a — > e *“A a ; we say that the gauginos 
have R charge +1. 

a) If R symmetry is to be a good symmetry of the lagrangian, what 
relation must hold between the R charges of the scalar and fermion 
components of a chiral superfield that couples to the gauge fields? 

b) In additon, what conditions must be placed on the superpotential? 

c) Identify the R symmetry, if any, of supersymmetric quantum elec- 
trodynamics. 
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96 The Minimal Supersymmetric Standard 

Model 

Prerequisite: 89, 95 

Having seen how to construct a general supersymmetric gauge theory in 
section 95, we can now write down a supersymmetric version of the Stan- 
dard Model. 

To do so, we introduce vector superfields for the gauge group SU(3) x 
SU(2) x U(l), which include the usual gauge bosons along with their fer- 
mionic partners, the gauginos ; chiral superfields Ljj , E r , Q ai/ , Uf, and Df 
that include three generations of quarks and leptons along with their scalar 
partners, the squarks and sleptons ; and chiral superfields Hi and Hi in the 
representations (1,2,— and (1,2,+^), which include two copies of the 
usual Higgs field along with their fermionic partners, the higgsinos. We 
give all these fields supersymmetric, gauge invariant kinetic terms, and a 
superpotential 

W = — yiJ Hi Lj j Ej - y' IJ H l Q a]I D° J - y'fiWQ^U* - p.: : ' 11,11; , (96.1) 

where p is a mass parameter. This superpotential generates the usual 
Yukawa couplings, among others, and gives a positive mass-squared to both 
Higgs fields. We forbid terms of the form H l Ln (which are allowed by 
the gauge symmetry) by invoking a discrete symmetry, R parity. Under R 
parity, all Standard Model fields (including both Higgs scalars) are taken to 
be even, and all superpartner fields (gauginos, higgsinos, squarks, sleptons) 
are taken to be odd. 

Supersymmetry is clearly not an exact symmetry of the real world, 
and so if present must be spontaneously broken. Correct phenomenology 
requires supersymmetry breaking to be triggered by fields other than those 
listed above. There are many possibilities for the dynamics of the fields 
in this hidden sector , and an exploration of them is beyond our scope. 
However, we can parameterize the effects of the hidden sector via a spurion 
field. This is a constant chiral superfield of the form 

S = m 2 s O0, (96.2) 

where ms is the supersymmetry breaking scale. We couple S to the quark, 
lepton, and Higgs superfields via D terms of the form 

Tspur , d = m~M S^S \c^H^H + C^H^H 

+ [c^l\Lj + C^ElEj + C^QlQj 

+ c\ U J ) U}U J + C { i° ) D\D J ]X . (96.3) 

J ) D 
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Here we have suppressed all but generation indices; each c \ j ^ is a dimen- 
sionless hermitian matrix in generation space. The parameter rriM is the 
messenger scale. Eq. (96.3) gives masses of order m 2 s /mM to the scalars. 

We also couple S to the chiral gauge superfields Wa via F terms of 
the form 


£spur lg auge = m^S £ 
i= 1,2,3 


+ h.C. , 
F 


(96.4) 


where the sum is over the three gauge-group factors; a is a spinor index. 
Eq. (96.4) gives masses of order m 2 s /rriM to the gauginos. 

Finally, we couple S to the chiral superfields via F terms of the form 


28 


c 


spur 


F = m 


,-i 

M 


s y C A W A 


A= 1 


+ h.c. 


(96.5) 


where W a is one of the 28 gauge-invariant terms in the superpotential. In 
particular, eq. (96.5) includes a mass term of the form HH + h.c. for the 
Higgs scalars. 

Eqs. (96.1) and (96.3-96.5) specify the Minimal Supersymmetric Stan- 
dard Model , or MSSM for short. Obviously, it has a complicated phe- 
nomenology that is beyond our scope to explore in detail. One point worth 
noting is that R parity implies that the lightest superpartner, or LSP, is 
absolutely stable. 


Reference Notes 

Supersymmetric versions of the Standard Model are discussed in Martin , 
Ramond II, and Weinberg III. 


Problems 

96.1) Explain why we need two Higgs doublets. 

96.2) a) Write the mass terms for the Higgs scalars as 

-Cffiggsmass = - m\H ] H - m 2 2 H ] H - + h.c.) , (96.6) 

and compute the quartic terms in H and H that arise from eliminating 
the auxilliary SU(2) and U(l) D fields. 

b) Find the conditions on the mass parameters in eq. (96.6) in order 
for the potential to be bounded below. 

c) Find the conditions on the mass parameters in eq. (96.6) in order 
to have spontaneous breaking of the SU(2) x U(l) symmetry. 
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d) Show that there are five physical Higgs particles, two charged and 
three neutral. 

e) Let tan f) = v/vhe the ratio of the two Higgs VEVs. Show that 

tan/? + cot/3 = (m^ + m\)/m\ . (96.7) 
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97 Grand Unification 

Prerequisite: 89 

The Standard Model is based on the gauge group SU(3) x SU(2) x U(l), 
with left-handed Weyl fields in three copies of the representation (1, 2, — |)© 
(1, 1, +1) © (3, 2, +g) © (3, 1, — |) © (3, 1, +g), and a complex scalar held 
in the representation (1,2,—^). The lagrangian includes all terms of mass 
dimension four or less that are allowed by the gauge symmetries and Lorentz 
invariance. 

To complete the specification of the Standard Model, we need twenty 
real numbers: the three gauge couplings; the three diagonal entries of each 
of the three diagonalized Yukawa coupling matrices (for the up quarks, the 
down quarks, and the charged leptons); the four angles in the CKM mixing 
matrix for the quarks; the vacuum angles for the SU(3) and SU(2) gauge 
groups; the scalar quartic coupling; and the scalar mass-squared. 

The goal of grand unification is to construct a more compact model 
with fewer parameters by supposing that the Standard Model is the result 
of the spontaneous breaking of a larger gauge symmetry. The simplest 
model along these lines is the Georgi- Glashow SU(5) model. Its start- 
ing point is the grand unified gauge group SU(5). We include a scalar 
held = 4> a T a in the adjoint or 24 representation, and assume that 
the scalar potential for this held results in a vacuum expectation value 
(VEV) of the form (0|<f>|0) = diag(— +^, +^)V. As we saw in 
section 84, this VEV spontaneously breaks the gauge symmetry down to 
SU(3) x SU(2) x U(l). The generator of the unbroken U(l) subgroup is 
T 24 = cdiag(— — |, +|, +|), where c 2 = 3/5. It will prove convenient 
to write 

T 24 = ^/|y, (97.1) 

and express the U(l) charge as the value of Y rather than the value of T 24 . 
We also note that the SU(5) breaking scale V must be considerably larger 
than the SU(2) x U(l) breaking scale v ~ 250 GeV in order to suppress the 
observable effects of the extra gauge helds. 

Under the SU(3) x SU(2) x U(l) subgroup, the fundamental and anti- 
fundamental representations of SU(5) transform as 

5 - (3,1,— i)ffi(l,2,+i) . (97.2) 

5-(3,l,+!)ffi(l,2,-i). (97.3) 

Next we use eq. (97.2) to hnd that the product 5 © 5 transforms as 

5 <g> 5 — > (6, 1, -§) s © (3, 2, +±) s © (3, 1, -§) S 

©(3,1,— |)a©(3, 2,+|)a©(1,3,+1)a , 


(97.4) 
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where the subscripts indicate the symmetric and antisymmetric parts of 
the product. In terms of SU(5), we have 

5® 5 = 15 s ©10 A • (97.5) 

Comparing eqs. (97.4) and (97.5), we find 

10— ► (3, 1,-|) ©(3,2, +±)© (1,3, +1) . (97.6) 

From eqs. (97.2) and (97.6), we see that one generation of quark and lepton 
fields fits exactly into the representation 5 © 10 of SU(5). 

We therefore define a left-handed Weyl field if 1 in the 5 representation 
of SU(5), and a left-handed Weyl field Xij = ~Xji i n the 10 representation. 
The gauge covariant derivatives of these fields are 

(PM = -igsAKTD'jV 

= ^ + ^“(n/V’ J ', (97.7) 

(DuXh = dfiXij — ^95 (T±q ) ij Xkl 

= d^Xij - mA^T^fixkj + ( T a )j l xu ] , (97.8) 

where g§ is the SU(5) gauge coupling and T a is the generator matrix in the 
fundamental representation. The kinetic terms for these fields are 

£kin = i'fij^fD^y + , (97.9) 

where the implicit sum over i and j is unrestricted; this necessitates the 
prefactor of one-half in the second term to avoid double counting. The 
interaction terms with the gauge fields then work out to be 

Ant = ~95 +X IJl ( A n)i k ^ IJ 'Xkj , (97.10) 

where A M = A a ^T a is the matrix-valued gauge field, and A ^ is its transpose. 
Note that we have written the factors in matrix-multiplication order (with 
a trace for the second term). 

We can identify the components of fi 1 and Xij as 


( d r 

d b 

d 9 

e 

) 

, (97.11) 

( ° 

u 9 

-u b 

u r 

d r \ 


—u 9 

0 

u r 

u b 

db 


u b 

—uf 

0 
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dg 

, (97.12) 
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where r, b, and g stand for the three colors (red, blue, and green). We can 
also write the gauge fields as 


V~\cB 

r* b 

Li y 

G r 9 

1 Y 1 

v / 2 Af ' 

1 v2 

72 Xr 

G b r 

G b b -\cB 

G b 9 

72 X b 

1 Y2 
72 X b 

G g r 

n b 
^ 9 

G g 9 -\cB 

71 A S 

1 v2 
V2 X 9 

1 Y^ r 


1 Y^ 9 

75 a i 

\W*+\cB 

7i' V+ 

1 Y^ r 

V2 X Z 

1 Y^ b 
72 A 2 

1 Y^ 9 
v / 2 A 2 

72 W ~ 

-|ir 3 +i 


J^CLrpCL 


V 

(97.13) 

where the Lorentz index has been omitted. Here B is the hypercharge 
gauge Held, and W 3 and y/2 W ± = W 1 ± iW 2 are the SU(2) gauge fields. 
The gluon fields Gj are subject to the constraint G r r + G b b + G g 9 = 0. The 
X l a fields correspond to the broken generators of SU(5), and hence become 
massive; here i is an SU(2) index and a is an SU(3) index. As we saw in 
section 84, the X l a fields transform as (3, 2, — |) under SU(3) x SU(2) x U(l), 
and their mass is M x = j~t^ 9 oV. 

If we substitute eqs. (97.11-97.13) into eq. (97.10), we find the usual 
interactions of the SU(3) x SU(2) x U(l) gauge fields with the quarks and 
leptons, but with 


9i = 92 = 93 = 9 5 


(97.14) 


These relations among the gauge couplings hold in the MS renormalization 
scheme; later we will discuss a modified scheme that is more appropriate 
at energies well below M x . 

We also find the couplings of the X field to quarks and leptons; these 
work out to be 




(ciJ)cr M e — eV 9 d a + u^<T li u l £ a p 1 ) 


+ X\ “(-c/tcHV + e* cr 9 u a + d t/3 (7 M u 7 £ a/ 3 7 ) 


+ h.c. 


= — (e u d^cr^Pj - £ lJ e ] cr 9 q ja + q ]0l a^v? e^) + h.c. 

= + h.c. , (97.15) 


where the last line defines the current J 9, that couples to the X g field. (To 
include more than one generation, we add a generation index I to each 
quark and lepton field, and sum over it.) The most interesting feature of 
eq. (97.15) is that the first two terms in the current have baryon number 
B = +)j and lepton number L = + 1, while the third term has B = — | 
and L = 0. Thus exchange of an X boson can violate baryon and lepton 
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number conservation, leading to phenomena such as proton decay. Proton 
decay has not been observed; the limit on the rate 1/r for p — > e + 7r° is 
r > 10 33 yr. A rough estimate of 1/r from eq. (97.15) is g^mp/8nM^. 
Taking 55 ~ g 2 ~ 0.6, we find that we must have M x > 3 x 10 15 GeV. 

We still need a scalar field in the representation (1, 2, — ^). The smallest 
complete representation of SU(5) that includes this piece is the 5. Call the 
corresponding field H'\ we can identify its components as 

IP = (<f> r (j) b cf) 9 ip~ -ip°). (97.16) 

The possible gauge-invariant Yukawa couplings with the r ip t and Xij fields 
are 

^Yuk = - yH^Xij ~ \y"e ijklrn H}xj k Xi m + h-c. ; (97.17) 

with three generations, y and y" become matrices in generation space. We 
can write out £yuk using eqs. (97.11), (97.12), and (97.16); the result is 

C-Yuk = ~ y£ l] <Pilje - ye lJ <piq aj d a - y"ip ]l q ai u a 

~ - y£ lJ cj) a q a ilj - y''(j)l t u a e + h.c. . (97.18) 

The terms on the first line are those of the Standard Model, except that 
the down-quark Yukawa coupling matrix (called y' in section 89) is the 
same as the charged-lepton Yukawa coupling matrix (called y in section 
88). Since the quark and lepton masses are directly proportional to the 
Yukawa couplings, eq. (97.18) predicts 

mi = m T , m s = m M = m e . (97.19) 

These relations hold in the MS renormalization scheme; later we will discuss 
a modified scheme that is more appropriate at energies well below M x . The 
terms on the second line of eq. (97.18) are the couplings of the colored scalar 
field cj) to the quarks and leptons; we see that these couplings, like those of 
the Xp field, violate baryon and lepton number conservation. Since first- 
generation Yukawa couplings are smaller than gauge couplings by a factor 
of 10 5 , the limit on from proton decay is roughly > 10 10 GeV. 

To compute M^, we need the complete scalar potential. For simplic- 
ity, we assume a Z2 symmetry under Then the most general 

renormalizable potential is 

V($, H) = - \m% Tr $ 2 + ±AiTr $ 4 + iA 2 (Tr <3> 2 ) 2 

+ m 2 H H ] H + - \k 2 H^ 2 H . (97.20) 

We take all the parameters (m|, m 2 H , Ai, A 2 , «i, k 2 ) to be positive. We 
found in problem 84.1 that in this case the first line of eq. (97.20) is mini- 
mized by = diag(— +|, +^)V, with V 2 = 36m|/(7Ai + 30A 2 ). 
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From the first and third terms on the second line of eq. (97.20), we find that 

the masses-squared of the ip ( 1 , 2 , — 77 ) and f r^i (3, 1, +g) scalar fields are 

ml = rrijf - \n 2 V 2 , (97.21) 

K = m 2 H - ±k 2 V 2 . (97.22) 

We want m 2 ~ — (lOOGeV ) 2 and M 2 > +(10 10 GeV) 2 . This requires 
ml to be equal to \k 2 V 2 to at least sixteen significant digits (but not 
exactly). There is no obvious reason for the parameters in eq. (97.20) to 
satisfy this odd relation; we would more naturally expect m 2 and Af 2 to 
have the same order of magnitude (and furthermore to have n 2 ~ g 2 and 
hence M& ~ Af x > 10 15 GeV). This is the fine timing problem of grand 
unified theories. More generally, we can ask why the breaking scale of 
the grand unified group is so much larger than the breaking scale of the 
electroweak subgroup; this is gauge hierarchy problem. 

Since the X f and </> fields are so heavy, we can integrate them out, 
generating effective interactions among the quarks and leptons . 1 For the 
X f field, we get the leading term (in a double expansion in powers of g§ and 
inverse powers of Af x ) by ignoring the kinetic energy and other interactions 
of the X/j field, solving the equations of motion for X /t that follow from 
Tv, mass + Tv,int, where Tv,int is given by eq. (97.15) and 


Tv, mass = -MlX^X* , (97.23) 

and finally substituting the solutions back into Tv, mass + Tv,int- This is 
equivalent to evaluating tree-level Feynman diagrams with a single X ex- 
changed. The result is 

Cx,a = 2^5 ■ (97.24) 

Keeping only fields from the first generation, we find that the baryon- and 
lepton-number violating terms in Tv, e ff are 


r \AB\=l 

,Y,efF 


||_ £l j £ a/t 7 ( d t cr / 4 £. _ e^a^qiofju^a^ + h.c. 


2 Ml 

9b ij afa 

Mf 


{^qji){diu ] p) + ^u\){q ia qjg) 


+ h.c. 


(97.25) 


where the second line follows from a Fierz identity and a relabeling of 
the color indices in the second term. We can treat the 4> held similarly; 

1 We should also integrate out the heavy components of $, which transform as (8, 1, 0)® 
(1, 3, 0) © (1, 1, 0) under SU(3) x SU(2) x U(l), but these do not couple directly to quarks 
and leptons. 
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see problem 97.2. We will compute the decay rate for p — > e + 7r° from 
eq. (97.25) in problem 97.4. 

Once the heavy fields have been integrated out, we can apply the MS 
renormalization scheme to the theory with the remaining light fields. This 
is not the same as MS for the original theory, because now only light fields 
circulate in loops. (Loops of heavy fields contribute to corrections to the 
effective interactions.) With the usual fields of the Standard Model, we 
find from our results in sections 66 and 73 that the one-loop beta functions 
for the three gauge couplings are given by 

+ °(9i) , (97.26) 

with 


b 3 = -11 + | n , (97.27) 

&2 = -f + | n + i, (97.28) 

6i = +fn + i, (97.29) 

where n = 3 in the number of generations; the ±| contributions to 62 
and b\ are from the ip field. These formulae apply for p < M x . (We 
are assuming that the heavy scalars do not have masses much less than 
M x .) For p > M x , we must restore the heavy fields, and then eq. (97.14) 
applies. If we now neglect the higher-loop corrections, integrate eq. (97.26) 
for each coupling, impose eq. (97.14) at p = M x , and set 52 = e/ sin0 w and 
g 1 = e/cos 9 W , we find for p < M x that 


1 

a 3 (p) 


1 

OL 


+ In (Mx/jti) 


sin 2 9 W (p) 
a(p) 


1 

<25 (M x ) 


+ 7 ^ ln(M x / p) 


cos 2 9 w (p) 
a(p) 


5/3 

OL 5(Mx) 


+ In [My.jp) . 


(97.30) 

(97.31) 

(97.32) 


The quantities on the left-hand sides are measured at p = M z to be 


a 3 (M z ) = 0.1187 ±0.0020 . 
l/a(M z ) = 127.91 ±0.02 , 
sin 2 9 W (M Z ) = 0.23120 ± 0.00015 . 


(97.33) 

(97.34) 

(97.35) 
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We can use eqs. (97.30-97.32) to solve for 1/0:5 (M x ) and ln(M x /M z ) in 
terms of the known parameters a(M z ) and 0:3 (M z ); the result is 


1 


1 


+ 


&1+&2 
«3 (M z ] 

8/3 

^i+&2-|^3 \ot{M z ) a 3 (M z ] 


~b 3 

OgfAfx) ^l+&2 — §^3 V a (M z ) 
In (M x /M z ) 2?r / 1 


(97.36) 

(97.37) 


Plugging in eqs. (97.27-97.29) and eqs. (97.33-97.34), we find 1/0:5 (M x ) = 
41.5 and M x = 7x 10 14 GeV; two-loop corrections lower M x to 4x 10 14 GeV. 
This value of M x is about an order of magnitude below the lower limit 
imposed by proton decay. 

We can also use eqs. (97.30-97.32) to express sin 2 9 W (M Z ) in terms of 
a (M z ) and 03 (M z ); the result is 


sin 2 9 W (M Z ) 


1 

bi+b 2 -^b 3 



-63 + ( 61 -f &2) 


a(M z ) \ 
«3 (M Z )J 


(97.38) 


This is a prediction of the SU(5) model that we can test. Plugging in 
eqs. (97.27-97.29) and eqs. (97.33-97.34), we find sin 2 9 W (M Z ) = 0.207. 
Two-loop corrections raise this to 0.210T0.001. Comparing with eq. (97.35), 
we see that the SU(5) prediction is too low by about 10%. 

The situation improves considerably if we consider the Minimal Su- 
persymmetric Standard Model, discussed in section 96. In this case the 
beta-function coefficients become 


63 = -9 + 2 n, (97.39) 

b 2 = —6 + 2n + 1 , (97.40) 

6 1 = +^n+l. (97.41) 

Now we find sin 2 0 w (M z ) = 0.231, with two-loop corrections raising it to 
0.234; there are, however, numerous sources of uncertainty related to the 
masses of the supersymmetric particles. We also find M x = 2 x 10 16 GeV; 
this result (which is not changed significantly by two-loop corrections) is 
high enough to avoid too-rapid proton decay. 

Next, let us consider the predicted equality of the down quark and 
charged lepton masses, eq. (97.19). These relations are subject to renormal- 
ization; see problem 97.5. However, the one-loop corrections from gauge- 
boson exchange cancel in the predicted ratios 


m e 


m d 

m s 


m n _ m s 
m T mb 


m . 


(97.42) 
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Alas, these predictions are not satisfied, in the first case by an order of 
magnitude. Resolving this problem requires either a more complicated 
set of Higgs fields, and/or including higher-dimension, nonrenormalizable 
terms in the lagrangian that are suppressed by inverse powers of the some 
mass scale larger than M x , such as the Planck mass Mp = 1.2 x 10 19 GeV. 
To get neutrino masses in the SU(5) model (see section 91), we must add 
left-handed Weyl fields zb that are singlets of SU(5), and couple them to 
the neutrinos via £yuk = 

A more elegant scheme starts with SO (10) as the grand unified gauge 
group. SO(IO) has a complex sixteen dimensional spinor representation 
that transforms as 1 © 5 © 10 under the SU(5) subgroup; thus, each gen- 
eration of fermions (including 9) fits into a single Weyl field in this 16 
representation. A scalar field in the 10 representation is needed for the 
Standard Model Higgs field, and additional scalars in higher dimensional 
representations (such as the 45 dimesional adjoint representation and the 
16) are needed to break SO(IO) down to SU(3) x SU(2) x U(l). 

A great variety of grand unified models can be constructed, with and 
without supersymmetry. Which, if any, are relevant to the natural world is 
a question yet to be answered. 


Problems 


97.1) Is the gauge symmetry of the SU(5) model anomalous? If it is, modify 
the model to turn it into a consistent quantum field theory. Prereq- 
uisite: section 75. Hint: see problem 70.4. 

97.2) Compute the A B = ±1 terms in the effective lagrangian that arise 
from <f> exchange. 

97.3) Let us write eq. (97.25) as 

£ e ff = — Zc\C\0\ — Zc 2 C 2 0 2 + h.c. , (97.43) 

where £>1 = £^£ a ^(e i q jj )(dlu ^ ) and 0 2 = £ l h a ^{e^u\ ){q ia qjp), 
Z Cl and Zc 2 are renormalizing factors, and C\ and C 2 are coefficients 
that depend on the MS renormalization scale p. At p = M x , we have 
C’i(Mx) = C 2 (M x ) = 4 tt a 5 (M x )/M|. 

a) Working in Lorenz gauge, and using the results of problems 88.7 
and 89.5, show that the one- loop contribution to Zc\ from gauge- 
boson exchange is given by Z rn in spinor electrodynamics in Lorenz 
gauge, with 


(— l)(+l)e 2 fo + £ a 'P'\TZ) a , a (T%)//£ a ^ 


& 
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+ 

+ 


e <y (W(^V7e <J ' + 0 

(-3)(+s) + (+3)("§) 


92 


9i 


(97.44) 


Evaluate these coefficients. 

b) Similarly, compute the one- loop contribution to Zc 2 from gauge- 
boson exchange. 

c) Compute the corresponding anomalous dimensions 71 and 72 of C\ 
and C2. 

d) Compute the numerical values of C\(p) and C 2 (p) at p = 2GeV. 
For simplicity, take the top quark mass equal to M z , and all other 
quark masses less than 2 GeV. Ignore electromagnetic renormalization 
below M z . 

97.4) Consider the proton decay mode with the most stringent experimental 
bound, p — > e + 7r°. The terms in eq. (97.43) relevant for this mode are 

£ e ff = C'i£ a/37 (e« 7 )(Jf-uj 3 ) + 2C , 2£ a/j7 (e t uj y )(d Q up) + h.c. . (97.45) 

Note that, under the SU(2 )l x SU(2 )r global symmetry of QCD that 
we discussed in section 83, the operator u(d'u t) transforms as the 
first component of a (2, 1) representation, while the operator v) (du) 
is related by parity, and transforms as the first component of a (1,2) 
representation. At low energies, we can replace these operators (up to 
an overall constant factor) with hadron fields with the same properties 
under Lorentz and SU(2 )l x SU(2 )r x U(l)y transformations. 

a) Show that P^{uJ\f)\ and P R (u'Af)i transform appropriately. Here 
u = exp[*7r a T a // 7r ], where 7r a is the triplet of pion fields, and A/) is 
the Dirac field for the proton-neutron doublet. 

b) Show that the low-energy version of eq. (97.45) is then 

C e e = CiA£°P h (uJ\f)i + 2C 2 A£ G P n {v)M) l + h.c. , (97.46) 


where £ c is the charge conjugate of the Dirac field for the electron 
(in other words, £ G is the Dirac field for the positron), and A is a 
constant with dimensions of mass-cubed. Lattice calculations have 
yielded a value of A = 0.0090 GeV 3 for p = 2 GeV. 

c) Write out the terms in eq. (97.46) that contain the proton field and 
either zero or one 7r° fields. 

d) Compute the amplitude for p — > e + 7r°. Note that there are two 
contributing Feynman diagrams: one where eq. (97.46) supplies the 
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proton-positron-pion vertex, and one where the proton emits a pion 
via the interaction in eq. (83.30), and then converts to a positron via 
the no-pion terms in eq. (97.46). Neglect the positron mass. Hint: 
your result should be proportional to 1 + g A . 

e) Compute the spin-averaged decay rate for p — > e + 7r°. Use the 
values of C\ and C 2 for p = 2 GeV that you computed in problem 
97.3. How does your answer compare with the naive estimate we 
made earlier? Hint: your result should be proportional to C\ + 4(71. 

97.5) Consider the Yukawa couplings for the the down quark and charged 
lepton of one generation, 

£yuk = - Zyye^piije- Z y iy'£ lJ piq aj d a , (97.47) 

where we have included renormalizing factors. 

a) Consider one-loop contributions from gauge-boson exchange to 
Z y ! jZy. Show that the only contributions of this type that do not 
cancel in the ratio are those where the gauge boson connects the two 
fermion lines. 

b) Show that, in Lorenz gauge, these contributions to Z y / and Z y 
are given by Z m in spinor electrodynamics in Lorenz gauge, with a 
replacement analogous to eq. (97.44) that you should specify. 

c) Let r = y'/y, and compute the anomalous dimension of r. 

d) Take ?’(Af x ) = 1, and evaluate r(M z ). For simplicity, take the top 
quark mass equal to M z . 

e) Below M z , treat the top quark as heavy, and neglect the small elec- 
tromagnetic contribution to the anomalous dimension of r. Compute 
r(mfe), where mb = mbijUb) = 4.3 GeV is the bottom quark mass pa- 
rameter. Use your results to predict the tau lepton mass. How does 
your prediction compare with its observed value, m T = 1.8 GeV? 
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